New York, New York

  Data Engineering



About the Company
Our client is enabling next-generation clinical trials through data-driven products, including their flagship product. It is a tech-enabled risk-based monitoring solution and our second product, a clinical operations benchmarking tool. This tool combines and delivers real-time data from various trial data sources, predicts issues during clinical development to reduce trial risks, and empower monitoring teams by visualizing critical study data. The products empower clinical teams to benchmark clinical trial performance and analyze vital indicators of trial success and failure

About the Role
How often are you given the opportunity to be a part of the team from the very beginning, with an abundance of resources at your disposal; to be part of a team of people accomplished in both scientific and engineering disciplines, focused on using the best of technology to address complex, problems that have a positive impact on millions of people lives? Well, this is that role!

Our client is looking for thoughtful, hands-on technology aficionado with a strong aptitude for data engineering to join their rapidly growing team in their New York City office. The Data Engineer will work closely with the development operations engineers, and data scientists front-end developers, back-end developers. They use a platform that is fully cloud-based and is being built around modern tools and frameworks in an incredibly fast-moving environment.

Key Responsibilities

  • Design, develop, and implement data infrastructure and pipelines that ingest and transform data from various external sources, storing it in highly optimized database systems, and making it useful to our application and reporting layers
  • Create automation systems and tools to configure, monitor, and orchestrate data infrastructure and pipelines
  • Create data integration services to help onboard new customers as quickly as possible
  • Maintain ongoing reliability, performance, and support of the data infrastructure, providing solutions based on application needs and anticipated growth
  • Participate in creating and maintaining strict compliance, data privacy and security measures
  • Develop robust and production-level code to implement new product features in collaboration with other engineers and subject matter experts
  • Identify and resolve performance and scalability issues, troubleshoot problems, and improve product quality
  • Collaborate with the Front-End Development team to thread the right information through to forward-facing applications
  • Interface with the Development Operations colleagues to evaluate and implement methodologies and workflows to facilitate the frequent and continuous release of high-quality software
  • Work closely with Data Science team to implement descriptive and predictive algorithms and models using the latest technologies
  • Keep up to date on emerging technology solutions, particularly those on AWS, for continuous improvements in data engineering
  • Help recruit highly capable engineers to the team from diverse backgrounds
  • Mentor and be mentored by engineers of varying experience levels and subject matter areas.

Minimum Requirements

  • 3+ years relevant experience with data engineering
  • Strong proficiency with Python and SQL
  • Experience with AWS, S3, EC2, EMR, or an similar cloud-hosted infrastructure
  • Experience with cloud-hosted database/data warehouse architecture (Snowflake, Redshift, etc.)
  • Experience writing complex data transformations in SQL or related frameworks ( dbt)
  • Interest in building distributed computing and orchestration frameworks  usibng Airflow
  • Experience working in an Agile software development environment
  • Exceptional written and verbal communication skills
  • Strong attention to detail and highly organized, with effective multi-tasking and prioritization skills
  • Proactive and self-directed, with the ability to learn quickly
  • Comfortable with ambiguity
  • Strong problem-solving and troubleshooting skills
  • Ability to work as part of a collaborative cross-functional team in a fast-paced environment
  •  Interest in working at a rapidly changing start-up and scaling with the company as we grow
  • Bachelor’s degree with strong academic performance in Computer Science, Software Engineering, Applied Science, or equivalent field

Preferred Qualifications

  • Experience building and deploying large-scale data processing pipelines
  • Experience integrating data from disparate data sources
  • Experience with healthcare data, ideally clinical/operational clinical trial data
  • Knowledge of clinical data standards (e.g. CDISC, FHIR, HL7, etc.)
  • Knowledge of e-clinical systems and technologies (e.g. EDC, CTMS, IRT, etc.)

If you are interested, please apply or send a resume to

related jobs