Manifold Bio company logo

Manifold Bio is hiring a

Senior Data Engineer

Back to Jobs
Boston
Posted 3 months ago
107 views

Job Description

Manifold Bio is a dynamic biotech company building a pipeline of  targeted biologics using a novel in vivo-centric discovery approach. Our drug discovery engine is differentiated by massively parallel screening in vivo from the beginning of our discovery process. This unique platform is powered by a proprietary protein barcoding technology that allows multiplexed protein quantitation at unprecedented scale and sensitivity. We combine this and other high-throughput protein engineering approaches with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges.

Manifold Bio is a dynamic biotech company building a pipeline of targeted biologics using a novel in vivo-centric discovery approach. Our drug discovery engine is differentiated by massively parallel screening in vivo from the beginning of our discovery process. This unique platform is powered by a proprietary protein barcoding technology that allows multiplexed protein quantitation at unprecedented scale and sensitivity. We combine this and other high-throughput protein engineering approaches with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges.

Position

Manifold Bio is seeking an exceptional Senior Data Engineer to join our growing team. In this role, you will work closely with data engineering, ML, computational and experimental scientists to build core infrastructure and extract insights from our Next Generation Sequencing readouts, design database schemas and data processing methodologies, build and support common infrastructure to execute pipelines, CI/CD, observability solution 

Responsibilities

  • Select, evaluate and deploy the best tools for the modern data stack supporting automated task execution:
    • Loading and transforming  data in Snowflake
    • Executing NGS pipelines developed by the computational biology team
  • Integrate data upload and processing using APIs from variety of data sources
  • Download and process public data sets 
  • Manage Snowflake database infrastructure providing robust secure access for internal users and external customers 
  • Build robust data validation and monitoring pipelines
  • Implement CI/CD pipelines for data management and NGS pipeline projects, including deployment of AWS infrastructure (IAM policies, Lambda functions, Containers, etc) 
  • Design and implement experimental results schemas in Benchling as well as Snowflake, working with biology teams scientists 
  • Development of the data driven visualization tools allowing access and interaction with the data (Tableau, Plotly, HoloViz or other Python based technology)
  • Creating image processing pipeline to organize and make accessible microscopy images 

Required Qualifications

  • Masters in bioinformatics, computational biology, cell biology or similar, or BS with 5+ years of relevant experience
  • 7+ years of data engineering experience 
  • Fluency coding in Python, SQL
  • Fluency managing Snowflake database, managing DBT models
  • Experience with translating and scaling local workflows onto AWS
  • Experience with at least one workflow system (Apache Airflow; but could be Nextflow, Dagster, AWS Batch)
  • Strong understanding of software engineering best practices

Preferred Qualifications

  • Deep knowledge of version control, test-driven development, and cloud computing
  • Experience working with workflow orchestrators/engines
  • Strong written and verbal communication skills for cross-functional collaboration
  • Nice to Have
    • Experience building and executing custom NGS workflows
    • Experience with variety of orchestration systems
    • Experience with Tableau and other data visualization packages (Plotly, HoloViz)

This Role Might Be Perfect For You If

  • You thrive in collaborative environments working with diverse set of internal customers
  • You're energized by solving complex data organization and processing issues 
  • You enjoy building robust, production-quality tools that others rely on for critical decisions
  • You're passionate about the therapeutic potential of engineered antibodies and want to accelerate their development
  • You love working at the intersection of cutting-edge data processing and computational methods to support high throughput biology



We value different experiences and ways of thinking and believe the most talented teams are built by bringing together people of diverse cultures, genders, and backgrounds.

Sponsored
⭐ Featured Partner

Sportstechjobs

Discover exciting opportunities in sports tech. Join innovative companies that are advancing sports through cutting-edge technology.

Remote FriendlyCompetitive SalarySportstech

Create a Job Alert

Interested in building your career at Manifold Bio? Get future opportunities sent straight to your email.

Create Alert

Related Opportunities

Discover similar positions that might interest you