COTA

Building and improving ETL pipelines to enable enhanced cancer treatment

COTA project
logo

The client

Established in 2011 by a multidisciplinary team of medical professionals, engineers, and data experts, COTA is dedicated to bringing clarity to the often fragmented and hard-to-access real-world medical data landscape. Leveraging the unique technology, sophisticated analytical methods, and extensive knowledge, COTA aims to structure intricate data to deliver a holistic view of cancer, thereby enhancing both treatment and scientific investigation.

Country

USA

Industry

Healthcare

Type

Scaleup

The product

COTA developed a suite of products for gathering, organizing and analyzing medical data related to cancer. The software allows for creating detailed representations of cancerous diseases, which are then utilized in various use cases striving to ensure patients receive the best possible care. The process involves sourcing quantitative medical data from various medical providers, organizing it and then presenting it either via an advanced data platform or in the form of specialized reports.

Technologies

Python

JavaScript

PostgreSQL

BigQuery

Google Cloud

Mirth Sync

Mirth Connect

RhinoJS

Flyway

COTA's product

The challenge

Business

COTA was looking for data engineers to support the existing engineering team in building a data ETL pipeline to transform and warehouse non-PHI clinical data. The challenge laid in the unique requirements of the software - the stack involved PostgreSQL, Google Cloud Platform, and NextGen Connect (Mirth) - a niche solution in healthcare that’s based on interface programming.

Technology

Technical challenges in the project were related to improving the data intake pipeline and included:

  • creating services to aggregate and map data from diverse sources,
  • improving the performance of these services,
  • creating migration and cleanup routines to standardize data sets,
  • automating monitoring services to ensure data quality, accuracy, and consistency,
  • thoroughly testing the system to guarantee its reliability.

Duration

2021 - ongoing

COTA challenge

The solution

Within weeks, we selected engineers that matched the client's requirements, had successfully onboarded them to the project and integrated with COTA’s in-house engineering team.

Under client’s management, the team has delivered measurable improvements to ETL pipelines, e.g.:

  • revamping COTA’s integration service, making it faster and more streamlined,
  • setting up a warehouse structure on Google Cloud to efficiently transform and securely store clinical data,
  • rigorously testing the system to ensure its reliability, which is critical in healthcare.

We continue our partnership with COTA, steadily growing the data engineering team and working on new projects.

Engagement Type

Dedicated development teams

Expertise

Data engineering

Results

We helped COTA find the right talent and improve ETL pipelines as a result.

Working together since 2021

A total of 3 data engineers involved in a project

Measurable improvements to ETL pipelines build time and quality

Sudhakar Velamoor, COTA, VP Engineering

Sudhakar Velamoor

VP Engineering, COTA

The Sunscrapers team has a great recruitment process, take feedback seriously and work on it diligently. They also helped find candidates who are willing to learn a different technology stack compared to what they had done before.

Let's talk

Discover how software, data, and AI can accelerate your growth. Let's discuss your goals and find the best solutions to help you achieve them.

Selected work

Hi there, we use cookies to provide you with an amazing experience on our site. If you continue without changing the settings, we’ll assume that you’re happy to receive all cookies on Sunscrapers website. You can change your cookie settings at any time.