Data Engineer

Job Category: Data Engineer
Job Type: Full Time
Job Location: United States

Company Overview

Gotham Technology Group, LLC is in the business of providing guidance and direction to IT professionals. With sales offices in Connecticut, New Jersey, and New York City, Gotham serves clients based throughout the Northeastern United States, and delivers goods and services across the globe. Gotham has been Certified™ as a Great Place to Work four years in a row. Visit the link below to view our company profile.

About the job

Job Title: Data Engineer

Our client, an innovative biotech company focused on advancing solutions through cutting-edge data-driven research is seeking a Data Engineer to join their team.

As they continue to enhance their data capabilities, the Data Engineer will support the design and optimization of data pipelines critical to research initiatives. This role will be instrumental in managing complex datasets, particularly in bioinformatics and protein analysis.

Role Overview:

As a Data Engineer, you will develop and optimize scalable ETL pipelines, ensuring seamless data extraction, transformation, and storage across multiple platforms. You will collaborate with cross-functional teams to structure data environments that enable efficient model training and similarity searches. The ideal candidate will have a strong background in cloud-based data engineering, with hands-on experience in Azure and Databricks environments.

Responsibilities:

  • Design, implement, and manage ETL workflows to process large-scale datasets, ensuring data integrity and accessibility.
  • Partner with bioinformatics and data science teams to develop data structures optimized for analysis, similarity searches, and model training.
  • Oversee data lake and warehouse management, ensuring high performance, accuracy, and consistency in cloud-based systems.
  • Define and implement data architecture solutions that align with business and research needs.
  • Ensure the secure handling and transfer of sensitive research data in compliance with regulatory and security guidelines.
  • Automate data processing workflows, enabling advanced analytics and data-driven insights.
  • Maintain thorough documentation and best practices for data management and pipeline development.

Qualifications:

  • Bachelor’s degree in Computer Science, Data Engineering, Bioinformatics, or a related field.
  • Experience building and maintaining ETL pipelines for large, complex datasets.
  • Proficiency in SQL and Python for data manipulation and pipeline automation.
  • Strong understanding of data modeling, architecture, and performance optimization techniques.
  • Ability to thrive in a collaborative, fast-paced environment.
  • Strong analytical skills with attention to detail.

Preferred Qualifications:

  • Familiarity with big data frameworks such as Apache
  • Experience supporting machine learning workflows through efficient data engineering.
  • Knowledge of bioinformatics tools and data analysis.
  • Understanding of compliance standards related to research data (e.g., HIPAA, GxP).
  • Experience with containerization tools such as Docker or Kubernetes.
  • Expertise in handling and processing datasets, including cross-matching and similarity search applicati

How to Apply:

APPLY

 

Apply for this position

Allowed Type(s): .pdf, .doc, .docx