Job Search
Senior/Lead Data Engineer (Python) IRC152716
Job: | IRC152716 |
Location: | Poland - Krakow |
Designation: | Senior Software Engineer |
Experience: | 3-5 years |
Function: | Engineering |
Skills: | AWS, Databases, Docker, Snowflake, SQL |
Remote | Yes |
Description:
GlobalLogic is partnering with a US-based digital health company, whose mission is to connect people in social interactions for the health conversations, build online health communities to support each other and share knowledge about how to deal with the disease
About client:
Our Client is the fast-paced digital healthcare company which creates the growing portfolio of online health communities for people with chronic conditions. Company’s mission is to improve patient’s quality of life by connecting them with each other, with caregivers and healthcare industry partners, building beneficial social interactions and meaningful health conversations.
About project:
The core of the project is working with the data; the target is to re-design and enhance data-driven solutions. The team is responsible for expanding and optimizing the data flow and data pipeline architecture, model and analyze data, interpret trends or patterns in complex data sets and translate them into product and marketing insights. If you are excited by the prospect of optimizing or even re-designing our data architecture to support our next generation of products and data initiatives, we would be thrilled to have you apply!
Requirements:
- Python
- AWS: SES, RDS (MySQL), EC2, Elasticsearch Service, Route 53, VPC, etc
- Snowflake
- Docker
- dbt
- Airflow
Job Description
● Strong Python skills
● Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
● A successful history of manipulating, processing and extracting value from large disconnected datasets
● Experience building and optimizing data pipelines, architectures and data sets
● Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
● Strong analytic skills related to working with unstructured datasets
Technologies:
Job Responsibilities:
Job Responsibilities
● Manage and optimize core data infrastructure
● Develop custom data infrastructure not available off-the-shelf
● Build monitoring infrastructure to give visibility into the pipeline’s status
● Monitor all jobs for impact on cluster performance
● Run maintenance routines regularly
● Tune table schemas to minimize costs and maximize performance
● Build and maintaining custom ingestion pipelines
● Build non-SQL transformation pipelines
● Build processes supporting data transformation, data structures, metadata, dependency and workload management
What We Offer
Exciting Projects: With clients across all industries and sectors, we offer an opportunity to work on market-defining products using the latest technologies.
Collaborative Environment: You can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment — or even abroad in one of our global centers or client facilities!
Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules.
Professional Development: We develop paths suited to your individual talents through international knowledge exchanges and professional certification opportunities.
Excellent Benefits: We provide our employees with private medical care, sports facilities cards, group life insurance, travel insurance, relocation package, food subsidies and cultural activities.
Fun Perks: We want you to feel comfortable in your work, which is why we create good working environment with relax zones, host social and teambuilding activities and stock our kitchen with delicious teas and coffees!