Senior Data Engineer

Bangalore | 5-6 years

Reports to: Lead Data Engineer 

  • Create and maintain optimal data pipeline architecture ETL/ ELT into Structured data 
  • Assemble large, complex data sets that meet functional / non-functional business requirements and create and maintain  multi-dimensional modeling like Star Schema and Snowflake Schema, normalization, de-normalization, and joining of datasets.
  • Expert-level experience creating Fact tables, Dimensional tables, and ingesting datasets into Cloud-based tools. Job Scheduling  and automation experience is a must. 
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re designing infrastructure for greater scalability, etc.
  • Set up and maintain data ingestion, streaming, scheduling, and job monitoring automation. Connectivity between the Data  factory, Blob storage, SQL, and Power BI needs to be maintained for uninterrupted automation. 
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data  sources using SQL and “big data” technologies on Azure 
  • Build analytics tools that utilize the data pipeline to provide actionable insight into customer acquisition, operational  efficiency, and other key business performance metrics 
  • Work with cross-functional teams including external consultants and IT teams to assist with data-related technical issues and  support their data infrastructure needs 
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into  an innovative industry leader.

Required Skills: 

  • Hands-on experience in data warehousing (Synapse or any OLAP) to support business/data analytics, business intelligence (BI)
  • Advanced knowledge of SQL and experience working with relational databases, query authoring (SQL) as well as working  familiarity with a variety of databases and Cloud Data warehouses 
  • Have a thorough understanding and experience in building and implementing scalable Machine Learning frameworks ready to  be consumed by Microservice Architecture.  
  • Deep understanding of foundational math associated with machine learning such as linear algebra, numerical optimization,  probabilistic models, and statistics. 
  • Data Model development, additional Dims and Facts creation, and creating views and procedures, enable programmability to  facilitate Automation 
  • Experience in data compression to improve processing and finetuning SQL programming skills required
  • Experience building and optimizing “big data” data pipelines, architectures, and data sets 
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions  and identify opportunities for improvement 
  • Strong analytic skills related to working with structured and unstructured datasets 
  • Experience with manipulating, processing, and extracting value from large unrelated datasets 
  • Working knowledge of message queuing, stream processing, and highly scalable “big data” stores
  • Strong analytical and problem-solving skills to be able to structure and solve open-ended business problems (pharma  experience is highly preferred)

Experience: 4-8 years of professional work experience in data warehousing

Industry Type: Pharmaceutical

Education: Bachelor’s degree in Computer Science, Software or Computer Engineering

Location: Bangalore