-
Unique ID:
#600
-
Price:
$
-
Location:
-
Posted on:
23rd of October 2013 at 1:30 AM
-
Expires in:
Expired
Data Scientist – 05552 (san mateo)
The Data Scientist is a member of a team collaborating on the design and implementation of big data analysis solutions for Sony services. This role requires strong analytical capability, excellent programming skills and the ability to communicate the analysis output clearly. The data scientist will work with local and remote engineering and business teams across multiple time zones to extract meaningful information from a wide variety of data collected from clients and online services.
Candidate needs to have excellent written and verbal communication skills and be a team player. Some international travel may be required.
Responsibilities:
•Collaborate with other data scientists and engineers in creating algorithms and heuristics to extract information from large data sets
•Implement algorithms in software using R and other languages
•Optimize implementations for high performance on large data sets
•Collaborate with engineering and operations teams on scaling and enhancing data analysis systems
•Analyze logs from online services
•Work with business teams on requirements and goals for analysis output
•Generate visualizations of the analysis results and present to engineering and business stakeholders
•Document all work and deliverables
•A Master’s degree in Computer Science, Mathematics or equivalent; a strong statistics background is necessary
•Demonstrable skill in implementing statistical algorithms in R
•Skilled in development with SQL databases (PostgreSQL, Oracle, etc.) and No-SQL data processing solutions (primarily Hadoop HBase)
•Capable of creating visualizations of data including tables, histograms, charts, data clouds, heat maps, etc
•
At least two years experience in data scientist role
Preferred Education / Skills:
•Doctorate in computer science, mathematics or similar field desired
•Some experience in general service programming (Java, Python, etc.)
•Experience with large data set handling (100s of terabytes or petabyte scale of data) collected from millions of users/clients
Apply •Compensation: Open