What will be your mission?
At Sony Pictures, our mission is to connect relevant movie and TV series content to consumers and audiences across transactional, subscription and linear viewing platforms. We’re building data products, visualization tools and insights platforms that will predict audience preferences and behavior for library and new releases to drive growth of engagement and consumption of Sony produced content. You will be reporting to the Executive Director of Data Management. Sony produces franchise brands, such as Spiderman, Men in Black and Jumanji, through to TV franchises such as Breaking Bad, Blacklist and Better Call Saul.
Our Data team is responsible for aggregating millions of consumer data points to build best in class predictive models, consumer trends systems, recommendation engines and more!
This position is responsible for developing and supporting new and existing datasets and partnering with IT to deliver ETL scripts to support operational reporting, management decision support, and advanced analytics.
Who are you?
You have a strong analytical and technical background and believe that data profiling, quality control, data integration across disparate data sets are foundational in enabling and driving a culture where data is democratized, accessible and easily interpreted for Sony Pictures to be a consumer-centric data driven organization.
You will work in the Data Management team that is responsible for the profiling, UAT, development and management of the Sony data licenses and cloud-based data platforms and sandboxes. You will be part of a team that is instrumental in partnering with Business Intelligence Team to enable data usage across the consumer analytics team, strategic business partners and business users.
You are a data optimization expert who is driven to ensure data quality, consistency, integration, connectivity across big data sets. You will work with our data science team, our business intelligence team, our strategic analysts and our diverse users in the business. You will provide a data management service spanning from data ingestion to end-user lifecycle: Profile, ingest, connect, features engineering, model, prepare, store and disseminate data.
You are data curious and business savvy with a strong background and demonstrated ability to navigate and analyze large and diverse data sets and be able to structure data so that it is meaningful and compelling.
What will you be doing?
o Work closely with consumer / audience analytics teams and technology partners as part of a complete data and analytics lifecycle.
o Conduct data lifecycle activities: problem identification, hypothesis definition, data integration, data profiling, prototyping and experimentation, data schema design, development, testing and deployment
o Conduct a broad range of analytics data management and optimization functions including development, scheduling, job optimization, version control management, data quality management, and issue resolution.
o Develop and deliver attributes and deploying data preparation processes to be utilized for data warehouses/repositories/big data platforms, self-service reporting solutions, scorecards and dashboards on user-level audience and consumer data using analytical and visualization tools.
o Work with teams to understand business requirements - translate and use complex data sources that needs to be blended, cleaned, structured and enriched into a desired output for analysis and utilized to deliver efficient and quality reports/ dashboards using various BI tools
o Partner with analysts to optimize and improve data operational processes
o Support User Acceptance Testing to ingest new data sets and migrate data sets to cloud platforms for data sandboxes
o Promote and evaluate use of software development best practices (including automated unit testing, Continuous Integration, Continuous Deployment, Dev Ops)
o Execute version control, change management and continuous improvement protocols
o Collect and refine data from disparate data sources to support business requirements
o Partner with IT to schedule and optimize various ETL scripts
o Access and create various data sources to supplement existing data
o Troubleshoot and resolve data discrepancies, nuances and issues. Perform quality audits, identify data gaps / anomalies; conduct root cause analysis with recommended solutions, and track error resolution through completion
o Collaborate with strategic analysts and data scientists to use expanded data sets and query techniques
o Develop custom datasets and execute data extract queries to support ad hoc analyses and data science initiatives
o Create and implement datasets for to support management, analysis, dashboard reporting and advance analytics models needs
o Conduct ad hoc data mining and data profiling activities on new data
o Create documentation and logs for new and existing data sources including: drafting data dictionaries, providing data field definitions, developing common business data language, data nuances, data summaries and tracking aggregate tables and features engineering calculations
o Maintain data catalog with additions, changes and deletions to ensure most up-to-date information is available to other team members
o Present and communicate data summaries and presentations with the goal of data education
o Minimum 2 years working as a Data Analyst, Data Profiler, Business Analyst for a medium to large scale BI implementation
o Big Data concepts and common components including open source Spark (AWS) and in memory (RAM) analysis and multiple languages
o Basic Java Script (useful if building APPS – but Python substitute to this)
o Scala (relevant to streaming data)
o Python, R
o Data products – how to productionalize Data Science (“DS”) prototypes (build logic and components with scale)
o Understanding of data engineering (data building blocks)
o Advanced SQL, and/or Python scripting experience
o Strong technology background: ETL and/or SQL coding experience: IBM Data Stage and Pentaho
o Understanding of API – working and connecting with API (Software engineering)
o Understand file constructs such Json, xml, flat files (CSV text files)
o Understanding of .Net (C#, VB Script, Java script) OR Java application development
o Experience in trouble shooting and resolving issues data systems
o Experience designing high performing database queries / indexes and troubleshooting database performance issues
o Excellent written and verbal communication skills
o Ability to deliver in a fast-paced and goals-based environment with time-bound deliverables
o Skilled in collaboration, enjoys working with others and achieving results as a team
o Passion and demonstrated experience in delivering results and meeting business goals in a matrix and cross functional environment
o Strong Business Intelligence / Data Warehouse technical analysis skills, including ability to read and interpret data models and entity relationship diagrams, as well as to read, write, and analyze complex SQL statements and their output
o 2-3 years of hands-on experience with Teradata and Oracle database platforms and tools to analyze them (SQL Assistant, SQL Developer, etc.)
o Prior experience working with centralized infrastructure teams for data architecture and database platform support
o Strong proficiency with Microsoft data analysis tools such as Excel and Access
o Strong experience in at least one of the following business disciplines: Retail Analytics, Consumer Analytics, and Product Development
o Bachelor’s degree in Information Technology, Computer Science, or Computer-related degree
o 2-3 years of experience with data warehousing and business intelligence
o 2-3 years of experience with all phases of a software development lifecycle