Few words about project
We’re looking for a Junior/Mid Data Engineer to join our team in Warsaw or remotely.
We are carrying out a project for our partner, a New York-based company. Our client is an innovative, healthcare company, founded by a settlement of doctors and IT people, that helps people globally to fight cancer. Thanks to IT solutions, created on the basis of top doctors' and oncologists' experience, it's easier to create the path of care.
The recruitment process is short and well-organised - 1 meeting with CTO (about 1 hour).
There is a plan to work at least 4 hours a day overlap with the USA East Coast timezone, so your work should be organized in hours like 10:00-18:00 or 11:00-19:00 PL timezone.
You will be responsible for...
Responsible and accountable for data science project scoping, feasibility, building analytical plan, and delivery of analytical outputs for life science clients.
Working on data mapping, data integrations, and ingestion, data processing and data automation.
Work with real-world data to utilise and sometimes create methodologies for appropriate analytics.
Synthesise domain knowledge in the oncology space with data science and analytics programming skills into actionable insights from the data.
Stack - Python, SQL, Pandas, NumPy, JavaScript, GCP, BigQuery, WildFly, Agile/Scrum, SciKit learn, Big Data, data lakes.
Taking part in afternoon meetings (once, twice a 2-week sprint time-session) with the USA.
Working hours (PL time) 10-18/11-19.
What's important for us?
Must have:
At least 2 years of experience in data-related IT fields, like data engineering, data development, data scientism or another, related to data IT operations.
Experience with Python and JavaScript
Solid data modeling knowledge and experience
Experience with data processing and pipelines
Strong Knowledge of SQL
Proficiency in distributed version control systems such as git
Good English skills, that make it possible to work in a foreign environment - at least B2.
Readiness to have an occasional call after 18:00 (once/twice a 2-week sprint).
Nice to have:
IT education background.
Understanding of Data formats and other Healthcare standards (HL7, FHIR, ICD, SNOMED, CPT, LOINC)
Understanding of infrastructure automation
Effective time management based on priorities dictated by the business
Service orientation toward customers (both internal and external)