HireStarter's client is a mission driven start-up looking to stop the spread of infectious disease. This is an opportunity to become a foundational member of the engineering team and design, build, & scale data pipelines, data warehouse, and machine learning infrastructure. You will be a key contributor in designing and building our data platform and delivering robust data pipelines that will ultimately have a meaningful impact for an important social mission. This role offers a flexible work environment.
Designing, building, and deploying efficient data pipelines.
Intelligently designing and implementing our data architecture.
Implementing inclusive data quality checks.
Providing data-driven insights.
Meeting data privacy and data security standards.
Securely source external data from multiple partners
5+ years of experience in data engineering building data warehouses and data pipelines.
Built large scale, data driven applications including elements like real-time streaming, batch data aggregation, data modeling, data cleaning, anomaly detection and bulk ingestion.
Experience designing and writing robust ETL jobs.
Experience with distributed data processing systems (Hive, Spark, Hadoop, etc.)
A passion for problem solving and providing solutions.
Strong software development skills at least one of the following: (Python, Java, Scala).
Extensive experience with SQL.
Experience with AWS (EC2, S3, EFS, RDS, DynamoDB, Lambda, Redshift, Kinesis)
Strong technical leadership skills.