Position Overview
Responsibilities
- Curate and validate omics metadata using structured workbooks and controlled vocabularies
- Run CLI-based workflows to upload metadata and processed data into databases
- Write SQL, shell, and Python scripts to validate, troubleshoot, and automate metadata tasks
- Acquire and map external datasets to internal metadata standards
- Coordinate with scientists, bioinformaticians, and data engineers to ensure metadata quality and usability
Mandatory Skills
- SQL (writing and debugging queries, joins, aggregations, data validation)
- Shell scripting (Bash, awk, sed, grep, automation)
- Python (scripting-level proficiency for data manipulation and file parsing)
- Experience with metadata, ontologies, or controlled vocabularies
- Life sciences background (genomics, transcriptomics, omics data)
- Experience with public omics repositories (NCBI GEO/SRA)