View Our Website View All Jobs

Data Platform Engineer

Who we are:

Calico is a research and development company whose mission is to harness advanced technologies to increase our understanding of the biology that controls lifespan, and to devise interventions that enable people to lead longer and healthier lives. Executing on this mission will require an unprecedented level of interdisciplinary effort and a long-term focus for which funding is already in place.

Position description:

Calico is seeking a strong software / data engineer to become a founding member of a small, world-class team that develops, productionizes, and maintains Calico’s data platform, including its data warehouse and a code base of high-quality data processing, analysis, and visualization algorithms. Here you will work in close collaboration with some of the world’s best life scientists and enable them to achieve ground-breaking discoveries in human health. You will do so by creating something new in a company that is both a nimble startup but also has a firm financial footing.

As a member of the Data Platform team you will work closely with computational biologists, machine learning experts, and scientists to create and maintain a platform to derive insights from data produced by Calico, by Calico’s collaborators, and from publicly available sources. Relevant data will span multiple organisms (from yeast to human), scales (from entire organisms to molecules), data modalities (from physiology to sequencing to imaging to free text), and time scales (from single time points to continuous time series). The team will build and maintain a data warehouse that stores this data in an organized format with metadata, and allows easy exploration, analysis, and visualization of this data by our scientists. The team will also help develop and maintain a repository of scalable, reusable tools for data processing and visualization, machine learning, and natural language processing

As a founding member of this new team, you will be a key part of setting the vision on how a data platform can best provide a productivity multiplier to our biological and computational scientists as they work toward finding solutions to help improve human healthspan.

Position requirements:

  • 4+ years of experience as a software or data engineer
  • Excellent coding skills in at least one system language (C++, Python, Java, etc.)
  • Familiarity with large-scale data warehouses and data processing technologies
  • Track record of effective collaboration on complex projects involving cross-functional partners with very diverse backgrounds
  • Ability to contribute to system architecture design
  • Ability to work independently and deliver excellent results
  • A strong desire to help make the world a better place

Nice to have:

  • Experience in some subset of: machine learning, data visualization, data integration, front-end development
  • Some background in biology or health sciences
  • Familiarity with the databases and data types used by computational biologists
  • Experience with data curation in a scientific environment
  • Experience working with Google Cloud Platform and APIs
Read More

Apply for this position

Apply with Indeed
Attach resume as .pdf, .doc, .docx, .odt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

To comply with government Equal Employment Opportunity / Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.
Veteran/Disability status