Located in Boston and the surrounding communities, Dana-Farber Cancer Institute brings together world renowned clinicians, innovative researchers and dedicated professionals, allies in the common mission of conquering cancer, HIV/AIDS and related diseases. Combining extremely talented people with the best technologies in a genuinely positive environment, we provide compassionate and comprehensive care to patients of all ages; we conduct research that advances treatment; we educate tomorrow's physician/researchers; we reach out to underserved members of our community; and we work with amazing partners, including other Harvard Medical School-affiliated hospitals.
The generation, management, and interpretation of molecular data across the Dana-Farber Cancer Institute (DFCI) and its collaborators across Brigham and Womens Hospital and Boston Childrens Hospital is critical for the advancement of precision oncology.
The Department of Informatics and Analytics (I&A) at Dana-Farber is seeking a Bioinformatics Engineer who is passionate about the potential for molecular data to inform clinical care of cancer patients and the groundbreaking discoveries that are a product of genomics and molecular data. This position will serve as the point person for the quality and availability of molecular testing data at DFCI. She/he will be focused on ensuring the highest of quality in our bioinformatics data that includes understanding and interrogating data for themselves as well as shepherding data to other systems for research and the enablement of precision medicine through the use of automation and data pipelines.
Manage and improve data ETL processes to integrate various types of internal and external molecular data into DFCI enterprise data warehouse (EDW)
Design and implement reusable and extensible data validation framework through structured data elements and schema
Review and enhance QC metrics to monitor and improve the pipeline execution, the data product quality, and enterprise molecular data integrity
Develop and maintain the automated test to ensure the successful pipeline execution and deployment of new features
Collaborate with I&A Bioinformatics and Data Science Group with cutting-edge Big Data technologies and Cloud ecosystem to leverage high-dimensional genomic data and reveal new scientific insights in cancer research and patient care
Evaluate different strategies and solutions for genomic data indexing, search, and retrieval
Serve as the liaison between the end users of enterprise genomic data and the development teams of EDW and software engineering; translate the users data requirements into specifications for EDW and software teams
Assist product management for the documentation of the ETL processes, QC metrics, data validation rules and unified genomic data dictionaries
Promote FAIR data principles, adopt the genomic data standards commonly used for research and clinical care, including but not limited to NCI Genomic Data Commons, ClinVar, dbVar, COSMIC, VICC, Sequence Ontology, and consortiums in genomic data curation and annotation
Manage relationships with other groups that share interdependencies on Dana-Farber molecular data
Bachelors degree required, MS or PhD preferred in Bioinformatics, Computational Biology, Data Science, Computer Science, or related discipline
Experience with Python required
Familiar with relational databases such as MySQL, Oracle, or similar
Prior experience with genomics or Next Generation Sequencing data preferred
Experience in working with data transformation pipelines and understanding of nuances of such pipelines
Experience with common cancer genetics databases is a plus (GDC, ClinVar, dbGaP, NCI GDC, COSMIC, etc.)
Detail oriented with ability and drive to gain deep understanding of technical systems
Ability to prioritize and manage various tasks and projects reliably and in a timely manner
Requires minimal direction from leadership and possesses the ability to adapt to new challenges as they arise
Excellent interpersonal skills, passionate about innovative solutions
Dana-Farber Cancer Institute is an equal opportunity employer and affirms the right of every qualified applicant to receive consideration for employment without regard to race, color, religion, sex, gender identity or expression, national origin, sexual orientation, genetic information, disability, age, ancestry, military service, protected veteran status, or other groups as protected by law.
Job ID: 2019-16341
External Company URL: www.dana-farber.org