代写 python graph statistic software Airbnb Take Home Challenge for Data Scientist (Analytics)

Airbnb Take Home Challenge for Data Scientist (Analytics)
Overview of Take Home Challenge
You’ll have ​48 hours​ to work on your analysis of the challenge. Please​ read carefully ​of the instructions and understand the audience of the presentation. Your work will be graded according to the following ​key points​.
General Advice
● Analysis
Clear metric definitions and analytical approach is highly valued in this challenge. Uses statistics appropriately and dentifie limitations of the work.
● Business intuition
Identify opportunities in the market and ​propose reasonable solutions. Opportunity Sizing and high-level thinking are pretty important. We hope the see sensible approach and reasonable order of magnitude.
● Communication
Writes clearly and concisely. Visualization effectiveness is essential for Data Scientist.
● Data foundation
The foundation is data, including data loading, summary, cleaning and your coding.
Notes
● Format
○ You are free to use whatever tools you are most comfortable with to work through
the analysis.
○ Please save and attach your project in its entirety in​ one document, including
any slides. ​If you have any code that you produce, please include the code file
and zip all files in one folder.
○ Please​ ​do​ ​NOT​​ ​​include​ ​your​ ​name​ ​or​ ​email​ ​address​ ​in your submission.
● Confidential: ​Please ​don’t share or publish the data​.

TAKE-HOME​ ​CHALLENGE:​ ​Data​ ​Science​ ​-​ ​Analytics
Airbnb​ ​is​ ​a​ ​two​ ​sided​ ​marketplace​ ​which​ ​matches​ ​guests​ ​to​ ​hosts.​ ​The​ ​booking​ ​flow​ ​at​ ​Airbnb​ ​is​ ​as​ ​follows:​ ​a​ ​guest finds​ ​an​ ​available​ ​room​ ​(listing)​ ​that​ ​they​ ​like,​ ​and​ ​then​ ​they​ ​contact​ ​the​ ​host.​ ​Once​ ​the​ ​guest​ ​finds​ ​a​ ​listing​ ​they are​ ​interested​ ​in,​ ​there​ ​are​ ​three​ ​ways​ ​to​ ​send​ ​the​ ​host​ ​an​ ​inquiry:​ ​‘contact_me’,​ ​‘book_it’,​ ​or​ ​‘instant_book’ (detailed​ ​at​ ​the​ ​bottom​ ​of​ ​this​ ​document).​ ​Upon​ ​receiving​ ​the​ ​inquiry,​ ​the​ ​host​ ​can​ ​then​ ​decide​ ​whether​ ​or​ ​not​ ​to accept​ ​the​ ​request​ ​(for​ ​‘contact_me’​ ​and​ ​‘book_it’​ ​methods​ ​–​ ​`instant_book`​ ​is​ ​auto-accepted).​ ​One​ ​of​ ​our​ ​goals​ ​at Airbnb​ ​is​ ​to​ ​increase​ ​bookings​ ​on​ ​our​ ​platform.
Prompt:
You​ ​are​ ​the​ ​first​ ​data​ ​scientist​ ​to​ ​join​ ​a​ ​cross-functional​ ​Product​ ​and​ ​Operations​ ​team​ ​working​ ​to​ ​grow​ ​bookings​ ​in Rio​ ​de​ ​Janeiro.​ ​The​ ​team​ ​asks​ ​you​ ​for​ ​help​ ​with​ ​the​ ​following:
1. What​ ​key​ ​metrics​ ​would​ ​you​ ​propose​ ​to​ ​monitor​ ​over​ ​time​ ​the​ ​success​ ​of​ ​the​ ​team’s​ ​efforts​ ​in​ ​improving the​ ​guest​ ​host​ ​matching​ ​process​ ​and​ ​why?​ ​​ ​Clearly​ ​define​ ​your​ ​metric(s)​ ​and​ ​explain​ ​how​ ​each​ ​is computed.
2. What​ ​areas​ ​should​ ​we​ ​invest​ ​in​ ​to​ ​increase​ ​the​ ​number​ ​of​ ​successful​ ​bookings​ ​in​ ​Rio​ ​de​ ​Janeiro?​ ​What segments​ ​are​ ​doing​ ​well​ ​and​ ​what​ ​could​ ​be​ ​improved?​ ​​​ ​​Propose​ ​2-3​ ​specific​ ​recommendations​ ​(business initiatives​ ​and​ ​product​ ​changes)​ ​that​ ​could​ ​address​ ​these​ ​opportunities.​ ​Demonstrate​ ​rationale​ ​behind each​ ​recommendation​ ​AND​ ​prioritize​ ​your​ ​recommendations​ ​in​ ​order​ ​of​ ​their​ ​estimated​ ​impact.
3. There​ ​is​ ​also​ ​interest​ ​from​ ​executives​ ​at​ ​Airbnb​ ​about​ ​the​ ​work​ ​you​ ​are​ ​doing,​ ​and​ ​a​ ​desire​ ​to​ ​understand the​ ​broader​ ​framing​ ​of​ ​the​ ​challenge​ ​of​ ​matching​ ​supply​ ​and​ ​demand,​ ​thinking​ ​beyond​ ​the​ ​data​ ​provided. What​ ​other​ ​research,​ ​experiments,​ ​or​ ​approaches​ ​could​ ​help​ ​the​ ​company​ ​get​ ​more​ ​clarity​ ​on​ ​the problem
Your​ ​assignment:​​​ ​Summarize​ ​your​ ​recommendations​ ​in​ ​response​ ​to​ ​the​ ​questions​ ​above​ ​in​ ​a​ ​5-8​ ​slide presentation​ ​intended​ ​for​ ​the​ ​Head​ ​of​ ​Product​ ​and​ ​VP​ ​of​ ​Operations​ ​(who​ ​is​ ​not​ ​technical).​ ​Include​ ​an​ ​organized appendix​ ​sharing​ ​the​ ​details​ ​of​ ​your​ ​work​ ​conducted​ ​for​ ​the​ ​Rio​ ​team,​ ​that​ ​would​ ​be​ ​useful​ ​for​ ​the​ ​data​ ​team​ ​to understand​ ​your​ ​work.
Instructions:
1. Create​ ​a​ ​​PDF​​ ​of​ ​your​ ​presentation.
2. Append​ ​all​ ​code​ ​you​ ​use​ ​to​ ​analyze​ ​results​ ​to​ ​the​ ​​above​ ​PDF​,​ ​including​ ​code​ ​used​ ​for​
​data​ ​exploration.​ ​We typically​ ​see​ ​data​ ​processed​ ​in​ ​SQL/R/Python​ ​and​ ​a​ ​presentation​

​with​ ​results​ ​made​ ​in​ ​Keynote/Google slides/Powerpoint.​ ​But​ ​you​ ​are​ ​welcome​ ​to​ ​use​ ​any​ ​software​ ​you​ ​feel​ ​comfortable​ ​with.​ ​If​ ​you​ ​use​ ​Excel, please​ ​document​ ​the​ ​operations​ ​used​ ​to​ ​process​ ​the​ ​data,​ ​and​ ​append​ ​your​ ​spreadsheet.
3. Please​ ​do​ ​NOT​ ​include​ ​your​ ​name​ ​or​ ​email​ ​address​ ​on​ ​this​ ​PDF.
4. You​ ​will​ ​have​ ​48​ ​hours​ ​to​ ​complete​ ​the​ ​assignment.
Grading:
Your​ ​assignment​ ​will​ ​be​ ​judged​ ​according​ ​to:
1. The​ ​analytical​ ​approach​ ​and​ ​clarity​ ​of​ ​your​ ​graphs,​ ​tables,​ ​visualizations,
2. The​ ​data​ ​decisions​ ​you​ ​made​ ​and​ ​reproducibility​ ​of​ ​the​ ​analysis,
3. Strength​ ​of​ ​recommendations,​ ​prioritizations,​ ​and​ ​rationale​ ​behind​ ​those,
4. The​ ​narrative​ ​of​ ​your​ ​presentation​ ​and​ ​ability​ ​to​ ​effectively​ ​communicate​ ​to​ ​non-technical​
​executives,
5. How​ ​well​ ​you​ ​followed​ ​the​ ​directions.
Data​ ​Provided:
Contacts​​ ​-​​ ​contains​ ​a​ ​row​ ​for​ ​every​ ​time​ ​that​ ​an​ ​user​ ​makes​ ​an​ ​inquiry​ ​for​ ​a​ ​stay​ ​at​ ​a​ ​listing​ ​in​
​Rio​ ​de​ ​Janeiro.
● id_guest_anon​ ​-​​ ​id​ ​of​ ​the​ ​guest​ ​making​ ​the​ ​inquiry.
● id_host_anon​ ​-​​ ​id​ ​of​ ​the​ ​host​ ​of​ ​the​ ​listing​ ​to​ ​which​ ​the​ ​inquiry​ ​is​ ​made.
● id_listing_anon​ ​-​​ ​id​ ​of​ ​the​ ​listing​ ​to​ ​which​ ​the​ ​inquiry​ ​is​ ​made.
● ts_interaction_first​ ​-​​ ​UTC​ ​timestamp​ ​of​ ​the​ ​moment​ ​the​ ​inquiry​ ​is​ ​made.
● ts_reply_at_first​ ​​-​ ​UTC​ ​timestamp​ ​of​ ​the​ ​moment​ ​the​ ​host​ ​replies​ ​to​ ​the​ ​inquiry,​ ​if​ ​so.
● ts_accepted_at_first​ ​-​​ ​UTC​ ​timestamp​ ​of​ ​the​ ​moment​ ​the​ ​host​ ​accepts​ ​the​ ​inquiry,​ ​if​ ​so.
● ts_booking_at​ ​-​ ​UTC​ ​timestamp​ ​of​ ​the​ ​moment​ ​the​ ​booking​ ​is​ ​made,​ ​if​ ​so.
● ds_checkin_first​ ​​-​ ​Date​ ​stamp​ ​of​ ​the​ ​check​-in​ ​date​ ​of​ ​the​ ​inquiry.
● ds_checkout_first​ ​​-​ ​Date​ ​stamp​ ​of​ ​the​ ​check-​out​ ​date​ ​of​ ​the​ ​inquiry.
● m_guests​ ​​-​ ​The​ ​number​ ​of​ ​guests​ ​the​ ​inquiry​ ​is​ ​for.
● m_interactions​ ​-​​ ​The​ ​total​ ​number​ ​of​ ​messages​ ​sent​ ​by​ ​both​ ​the​ ​guest​ ​and​ ​host.
● m_first_message_length_in_characters​ ​-​​ ​Number​ ​of​ ​characters​ ​in​ ​the​ ​first​ ​message​ ​sent​
​by​ ​the​ ​guest,​ ​if​ ​a message​ ​was​ ​sent
● contact_channel_first​ ​-​​ ​The​ ​contact​ ​channel​ ​through​ ​which​ ​the​ ​inquiry​ ​was​ ​made.​ ​One​ ​of​
​{contact_me, book_it,​ ​instant_book}.​ ​*See​ ​bottom​ ​of​ ​page​ ​for​ ​more​ ​detail*
● guest_user_stage_first​ ​​-​ ​Indicates​ ​whether​ ​the​ ​user​ ​has​ ​made​ ​a​ ​booking​ ​before​ ​sending​
​the​ ​inquiry​ ​(“past booker”).​ ​If​ ​the​ ​user​ ​has​ ​not​ ​booked​ ​before,​ ​then​ ​the​ ​user​ ​is​ ​a​ ​new​ ​user.

Listings​​ ​-​​ ​contains​ ​data​ ​for​ ​every​ ​listing​ ​in​ ​the​ ​market
● id_listing_anon​ ​​-​ ​anonymized​ ​id​ ​of​ ​the​ ​listing
● room_type​ ​-​​ ​indicates​ ​whether​ ​the​ ​room​ ​is​ ​an​ ​entire​ ​home,​ ​private​ ​room,​ ​or​ ​shared​ ​room
● listing_neighborhood​ ​-​​ ​the​ ​neighborhood​ ​of​ ​the​ ​listing
● total_reviews​ ​-​​ ​the​ ​total​ ​number​ ​of​ ​reviews​ ​of​ ​the​ ​listing​ ​(at​ ​the​ ​time​ ​the​ ​data​ ​was​ ​pulled).
Users​​ ​-​​ ​contains​ ​data​ ​for​ ​every​ ​user
● id_user_anon​ ​​-​ ​anonymized​ ​id​ ​of​ ​user
● words_in_user_profile​ ​-​ ​the​ ​number​ ​of​ ​words​ ​in​ ​the​ ​“about​ ​me”​ ​section​ ​of​ ​the​ ​user’s​
​Airbnb​ ​profile​ ​(at the​ ​time​ ​of​ ​contact)
● country​ ​-​​ ​origin​ ​country​ ​of​ ​the​ ​user
Further​ ​Information:
There​ ​are​ ​three​ ​ways​ ​to​ ​book​ ​a​ ​listing​ ​on​ ​Airbnb:
1. contact_me​​ ​-​​ ​The​ ​guests​ ​writes​ ​a​ ​message​ ​to​ ​the​ ​host​ ​to​ ​inquire​ ​about​ ​the​ ​listing.​ ​The​ ​host​ ​has​ ​the​ ​option to​ ​1)​ ​pre-​approve​ ​the​ ​guest​ ​to​ ​book​ ​their​ ​place,​ ​or​ ​2)​ ​they​ ​can​ ​reject,​ ​or​ ​3)​ ​they​ ​can​ ​write​ ​a​ ​free​ ​text message​ ​with​ ​no​ ​explicit​ ​acceptance​ ​or​ ​rejection.​ ​If​ ​the​ ​host​ ​pre-​approves,​ ​the​ ​guest​ ​can​ ​then​ ​go​ ​ahead and​ ​click​ ​to​ ​make​ ​the​ ​booking​ ​(but​ ​is​ ​not​ ​obligated​ ​to).
2. book_it​ ​​​-​ ​The​ ​guest​ ​puts​ ​money​ ​down​ ​to​ ​book​ ​the​ ​place​ ​directly,​ ​but​ ​the​ ​host​ ​has​ ​to​ ​accept​ ​the reservation​ ​request.​ ​If​ ​the​ ​host​ ​accepts,​ ​the​ ​booking​ ​happens​ ​automatically.​ ​If​ ​you​ ​have​ ​used​ ​Airbnb before,​ ​this​ ​shows​ ​up​ ​as​ ​a​ ​button​ ​labeled​ ​“Request​ ​to​ ​book”.
3. instant_book​​ ​-​​ ​The​ ​guest​ ​books​ ​the​ ​listing​ ​directly,​ ​without​ ​any​ ​need​ ​for​ ​the​ ​host​ ​to​ ​accept​ ​or​ ​reject actively​ ​(it​ ​is​ ​auto​-accepted​ ​by​ ​the​ ​host).​ ​This​ ​shows​ ​up​ ​as​ ​a​ ​button​ ​labeled​ ​“Book”.
Note:​​ ​A​ ​host​ ​can​ ​opt-in​ ​to​ ​the​ ​`instant_book`​ ​feature.​ ​If​ ​a​ ​host​ ​does​ ​so,​ ​a​ ​guest​ ​can​ ​use​ ​the​ ​`contact_me`​ ​or `instant_book`​ ​channels​ ​for​ ​booking​ ​that​ ​particular​ ​listing,​ ​but​ ​cannot​ ​use​ ​the​ ​`book_it`​ ​functionality.​ ​Alternatively, if​ ​a​ ​host​ ​does​ ​not​ ​opt​ ​in,​ ​a​ ​guest​ ​can​ ​use​ ​the​ ​`contact_me`​ ​or​ ​`book_it`​ ​channels​ ​only.​ ​​ ​We​ ​suggest​ ​that​ ​you​ ​browse the​ ​Airbnb​ ​website​ ​and​ ​look​ ​at​ ​listings​ ​to​ ​see​ ​the​ ​different​ ​ways​ ​that​ ​you​ ​can​ ​message​ ​a​ ​host.