Senior Data Engineer


Our mission is to make biology easier to engineer. Ginkgo is constructing, editing, and redesigning the living world in order to answer the globe’s growing challenges in health, energy, food, materials, and more. Our bioengineers make use of an in-house automated foundry for designing and building new organisms.


We're creating the codebase, compiler, and debugger for biology. We have built a strong set of internal software tools, automation, and processes that enable high-throughput genetic engineering across multiple species. We want to make them better, more powerful, more scalable, and more effective, while making them easier to use, manage, and deploy.


As a Senior Data Engineer, you’ll join in architecting our platform to support analytics and machine learning that will ultimately help to define how our bioengineering is performed at scale.  Ginkgo's programming languages of choice are Python and SQL, and DNA, but you are someone who loves writing elegant code in any language.  Plus, you're an experienced data wrangler who enjoys building systems from the ground up. Most importantly, you will be passionate about making biology the next engineering discipline.


Note: The current list of tools we utilize includes RDS Postgres, Snowflake, Airflow, AWS DMS, Spark on EMR, and Python. Extensive experience with the tools we use is not required, but rather a working understanding of the Desired Software and Tools listed below is preferred.


Desired Software and Tools Working Knowledge

*Data pipeline and workflow management tools: Airflow, Luigi, etc.

*Big Data tools: Snowflake, Hive, Spark.

*AWS cloud services: EC2, EMR, RDS, Redshift, S3.

*Languages: Python, Java, Scala, etc.




*Expanding and optimizing our data pipeline architecture, as well as flow and collection for cross functional teams. This includes: automating manual processes, ETL, re-designing infrastructure for greater scalability, and improving reliability and accuracy.

*Supporting our software engineering initiatives to ensure optimal delivery architecture is consistent throughout on-going projects.

*Using appropriate tools to analyze the data pipeline and provide actionable insights into operational efficiency, data accuracy, and other KPI’s.

*Working with various stakeholders to assist with related technical issues and infrastructure needs.

*Keeping our data secure.

If remote, must be able to start workday at 10am eastern standard time.


Desired Experience and Capabilities

*BS, MS, or PhD in computer science or related quantitative field

*5+ years of data engineering experience, with advanced knowledge of database design best practices

*Experience working with relational databases, data warehouses, and big data platforms.

*Demonstrated ability performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.

*Strong analytical skills in relation to working with large datasets.

*Experience building processes that support data transformation, data structures, metadata, dependency, and workload management.

*Working knowledge of message queuing, stream processing, and highly scalable big data stores.


*Analytical, highly motivated self-starter, with strong project management and organizational skills.


We also feel that it’s important to point out the obvious here – there’s a serious lack of diversity in our industry, and that needs to change. Our goal is to help drive that change. Ginkgo is deeply committed to diversity, equity, and inclusion in all of its practices, especially when it comes to growing our team. Our culture promotes inclusion and embraces how rewarding it is to work with people from all walks of life.  


We’re developing a powerful biological engineering platform, so we must remain mindful of the many ways our technology can – and will – impact people around the world. We care about how our platform is used, and having a diverse team to build it gives us the best chance that it’s something we’ll be proud of as it continues to grow. Therefore, it’s critical that we incorporate the diverse voices and visions of all those who play a role in the future of biology.


It is the policy of Ginkgo Bioworks to provide equal employment opportunities to all employees and employment applicants.

About Ginkgo Bioworks:

Ginkgo Bioworks was founded in 2008 with the mission to make biology easier to engineer. Engineering biology isn’t easy. It is frustratingly, painfully difficult. It’s programming without a debugger, manufacturing without CAD, and construction without cranes. At Ginkgo we are building a team that can build debuggers, write CAD, and operate cranes. We are looking for the best engineers, scientists, and hackers.

Other Jobs from Ginkgo Bioworks

Ginkgo Bioworks

Capture Manager-Concentric by Ginkgo

Multiple Locations
Full Time

Ginkgo Bioworks

Accounts Payable Operations Manager

Multiple Locations
Full Time

Ginkgo Bioworks

DevOps Product Manager

Boston, MA
Full Time

Ginkgo Bioworks

Data Science Manager

Multiple Locations
Full Time

Ginkgo Bioworks

Senior Data Engineer

Full Time

Help us improve SACC by providing feedback about this job: Report an issue