Receive alerts when this company posts new jobs.
Senior Data Engineer
at CVS Health
Participate in the design, build, and management of large scale data structures and pipelines and efficient Extract/Load/Transform (ETL) workflows. Develop large scale data structures and pipelines to organize, collect and standardize data that will help generate insights and address reporting needs. Write ETL (Extract / Transform / Load) processes, design database systems and develop tools for real-time and offline analytic processing. Develop systems that process, store, and serve data for use by others. Maximize the utilization of data to generate insights and address business needs. Use statistical predictive modeling to evaluate scenarios and make predictions on future outcomes in order to solve highly complex problems and support decision making. Analyze very large data sets in real time databases and develop and implement mathematical approaches. Work alongside data science team to transform data and integrate algorithms and models into automated processes. Utilize Hadoop architecture and HDFS commands, and design and optimize queries to build data pipelines. Utilize Python, Java, Hive, Cassandra, Pig, MySQL or NoSQL to build robust data pipelines and dynamic systems. Build data marts and data models to support Data Science and other internal customers and integrate data from a variety of sources, assuring that they adhere to data quality and accessibility standards. Analyze current information technology environments to identify and assess critical capabilities and recommend solutions. Experiment with available tools and advise on new tools in order to determine optimal solution given the requirements dictated by the model/use case.
Bachelor’s degree in Computer Science, Computer Engineering, or a related field.
Minimum five years of experience: designing and building ETL Data Pipelines to load data into data warehouses. Prior experience must include: utilizing data structures and data processing (searching/sorting) algorithms; participating in technical solutions (HLD/LLD) design sessions; writing complex SQL/Hive queries to perform transformations of large datasets; conducting data profiling and analyzing results for reporting requirements; developing ETL Data pipelines using ETL tools and frameworks including Apache-Spark, Informatica Power center 9.x, Python, and SSIS to extract data from sources including Flat files, XML files, Json, IBM MQ Sources, and relational databases including Oracle, DB2 and MS SQL Server; and analyzing and resolving defects in production data pipelines.
Percent of Travel Required
Aetna, a CVS Health company, we are joined in a common purpose: helping people on their path to better health. We are working to transform health care through innovations that make quality care more accessible, easier to use, less expensive and patient-focused. Working together and organizing around the individual, we are pioneering a new approach to total health that puts people at the heart.
We are committed to maintaining a diverse and inclusive workplace. CVS Health is an equal opportunity and affirmative action employer. We do not discriminate in recruiting, hiring or promotion based on race, ethnicity, gender, gender identity, age, disability or protected veteran status. We proudly support and encourage people with military experience (active, veterans, reservists and National Guard) as well as military spouses to apply for CVS Health job opportunities.