Rozwiązania technologiczne
Rozwiązania technologiczneW świecie innowacji każdy pomysł ma znaczenie. Nawet drobne idee mogą prowadzić do rewo...
Czy wiesz, że za pomocą kodu można skomponować symfonię, a sztuczna inteligencja potraf...
SANTA CLARA, Kalifornia, 10.01.2025 – GlobalLogic, spółka należąca do Grupy Hitachi i l...
Hitachi Cyber i GlobalLogic otwierają nowoczesne Centrum Operacji Bezpieczeństwa (SOC) ...
Consultant
Engineering
5-10 years
India - Bangalore
AWS, bash scripting, Cloud Infrastructure, Deployment, DevOps Practices, Grafana, Incident Management, Infra as Code, Jenkins build automation tool, Jenkins Pipelines, Monitoring and Logging, Python, Systems Architecture
Hybrid
About the Role
We are looking for a highly skilled and experienced Senior Site Reliability Engineer (SRE) to join our team and play a key role in building and scaling the infrastructure of an advertising platform. The ideal candidate will have a strong background in system design, automation, CI/CD, monitoring, capacity planning, and cloud infrastructure (AWS) — with a passion for creating reliable, scalable, and highly available systems.
Required Skills & Qualifications
8+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Strong programming and scripting skills in Python, Go, Bash (or similar), with a focus on automation and tooling.
Expertise in CI/CD pipelines (Jenkins or similar) and infrastructure-as-code (Terraform, CloudFormation).
Hands-on experience with AWS services (EC2, RDS, S3, VPC, IAM, CloudWatch, etc.) for infrastructure design and operations.
Proficiency in Prometheus (or other monitoring/alerting systems) and incident management practices.
Solid understanding of system design, distributed systems, and large-scale architecture.
Strong background in capacity planning, performance tuning, and load testing.
Excellent problem-solving, communication, and collaboration skills.
Key Responsibilities
System Design & Architecture
Design, build, and maintain scalable, resilient, and highly available infrastructure and services for our’s advertising platform.
Collaborate with engineering teams to ensure new products and features are built with reliability, scalability, and performance in mind.
Implement redundancy, failover strategies, and automated recovery mechanisms to minimize downtime and enhance service reliability.
Leverage AWS services (e.g., EC2, RDS, S3, Lambda, VPC, IAM) to design and optimize infrastructure.
Automation & Tooling
Develop automation frameworks and tools to improve CI/CD pipelines, infrastructure provisioning, and operational workflows.
Leverage strong programming and scripting skills (Python, Go, Bash) to build scalable automation solutions, reducing manual intervention.
Drive initiatives for end-to-end automation, optimizing efficiency and reducing human error.
Monitoring & Incident Management
Implement and maintain robust monitoring systems (e.g., Prometheus, Grafana) with real-time alerting on key system metrics (latency, availability, etc.).
Lead incident response, troubleshooting, and root cause analysis, ensuring learnings are captured through post-mortem reviews.
Collaborate with support and engineering teams to reduce MTTR (Mean Time to Recovery) and prevent recurring issues.
Performance Optimization & Capacity Planning
Analyze system performance and recommend improvements for latency, throughput, and cost optimization.
Conduct capacity planning and load testing to ensure infrastructure can handle growth and peak traffic demands.
Identify and eliminate bottlenecks to improve reliability and efficiency.
Collaboration & Knowledge Sharing
Work closely with engineers, product managers, and stakeholders to align system reliability with business goals.
Document best practices, system designs, and incident response procedures to improve team efficiency and knowledge sharing.
Mentor and provide technical guidance to junior engineers, promoting a culture of continuous learning and improvement.
Culture of caring. At GlobalLogic, we prioritize a culture of caring. Across every region and department, at every level, we consistently put people first. From day one, you’ll experience an inclusive culture of acceptance and belonging, where you’ll have the chance to build meaningful connections with collaborative teammates, supportive managers, and compassionate leaders.
Learning and development. We are committed to your continuous learning and development. You’ll learn and grow daily in an environment with many opportunities to try new things, sharpen your skills, and advance your career at GlobalLogic. With our Career Navigator tool as just one example, GlobalLogic offers a rich array of programs, training curricula, and hands-on opportunities to grow personally and professionally.
Interesting & meaningful work. GlobalLogic is known for engineering impact for and with clients around the world. As part of our team, you’ll have the chance to work on projects that matter. Each is a unique opportunity to engage your curiosity and creative problem-solving skills as you help clients reimagine what’s possible and bring new solutions to market. In the process, you’ll have the privilege of working on some of the most cutting-edge and impactful solutions shaping the world today.
Balance and flexibility. We believe in the importance of balance and flexibility. With many functional career areas, roles, and work arrangements, you can explore ways of achieving the perfect balance between your work and life. Your life extends beyond the office, and we always do our best to help you integrate and balance the best of work and life, having fun along the way!
High-trust organization. We are a high-trust organization where integrity is key. By joining GlobalLogic, you’re placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone of our value proposition to our employees and clients. You will find truthfulness, candor, and integrity in everything we do.
GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world’s largest and most forward-thinking companies. Since 2000, we’ve been at the forefront of the digital revolution – helping create some of the most innovative and widely used digital products and experiences. Today we continue to collaborate with clients in transforming businesses and redefining industries through intelligent products, platforms, and services.