It is in this hackthon I saw Hilary Mason, Claudia Perlich and many other interesting people. It is in this hackathon I appreciate some wonderful idea, such as the lyartist (an app help lyric writer), xbox visulization, and the app to help you stay from dangerous area of a city.
Our team analyze 550 indeed job post that are hiring data scientist and we got some interesting result.
Here are the top skills that need for a data scientist. Want to do data science? start to learn python first.
| python | 270 |
| r | 268 |
| sql | 258 |
| hadoop | 229 |
| java | 174 |
| processing | 157 |
| excel | 138 |
| c | 112 |
| matlab | 99 |
| c.. | 88 |
Another good news for newbie data scientist is that, company are not expected you know all about web developing, algorithm design, SQL and Non-SQL database management, hadoop family and scala. Those full stack data science is few.
Besides, the top 3 algorithm are Clustering, Sampling, Ridge Regression, Ada boosting. In a words, you do not need to know exact detail of complicated algorithm. Understanding and applying those simple algorithm are more important.
While what is the difference between the great and good data scientist? Claudia Perlich gave me the answer: data intuition. She compare data scientist to detective, great detective always had assumption then confirm or rejected their assumption by fact. Great data scientist must have sufficient understanding about human behavior and the industry.
Another idea that worth mention is there is no clean data, you had to embrace the randomness of the world. Good data is that truly reflect fact not the clean and well formatted ones.
Also I like those advice given by data scientist panel:
1. When you are young, do the job that you really like. Do not work for money.
2. Always keep writing.
3. Got some sleep after hackathon.

没有评论:
发表评论