Project Description
The Real-Time Cloud Site Reliability Engineers team is responsible for contributing to and reusing a set of standards-based Cloud patterns within a shared DevOps culture that will enable the Developers they partner with to accelerate the time-to-market of critical software applications to the Public Cloud. They will be contributing to the creation, testing, application and refinement of these Cloud patterns.
As part of the SDLC the Real-Time Cloud Site Reliability Engineer will develop and automate all aspects of application reliability using common industry tools (Jenkins, GitLab, Concourse) and practices increasing wider efficiency, reducing toil while also improving reliability across all major CSPs.
In this role the candidate will be expected to be highly proactive and knowledgeable Site Reliability Engineer working with product, development, and QE in our effort to continually improve the operability, scalability, security and cost of every Public Cloud Real-Time application we deploy and manage.
Responsibilities
-
Technology (80%)
• Create and employ reusable Infrastructure as Code templates via terraform and externalized configuration tools to construct complex Public Cloud applications that can be deployed to multiple environments.
• Monitoring, Alerting, Process and efficiency automation
• Build Continuous Integration and Continuous Delivery pipelines that automate the deployment of applications and the monitoring and logging they require to Public Cloud.
• Coach Developers on cloud-native patterns and best practices that, when followed, will enable our partners within Reliability Engineering to simplify and scale their application support.
• Collaborate with and solicit feedback from Site Reliability Engineers, Application Delivery Engineers, Developers, Network Engineers, and Security Engineers to identify missing Cloud Delivery patterns to be built.
• Contribute to interactions with our Public Cloud Architecture team and with the wider Refinitiv technology community to improve the Public Cloud Platform services and tools we provide to Developers.
• Maintain an understanding of how industry trends and new technologies can be leveraged to meet or exceed our business objectives.
• Incident response and remediation
Leadership & Communication (15%)
• Collaborate with Product Engineering teams to translate high-level technical requirements into Infrastructure as Code.
• Build rapport with developers and develop a culture of mutual respect.
• Communicate effectively and professionally with internal customers, management and peers across the globe.
• Lead meetings with internal customers and effectively negotiate expectations.
• Capture and facilitate the sharing of metrics around the time-to-market and operability of the applications we deliver to Public Cloud.
Business Acumen and Strategy (5%)
• Develop a broad knowledge of Refinitiv business strategies and our product portfolio.
• Innovate on new and improved uses of technology to meet business objectives.
• Understand the impact of new technologies and their technical impacts to supported systems.
• Collaborate with peers, architects and managers to understand strategy and how best to operationalize it across our cloud environments.
Skills
Must have
-
• 4-5 years of experience working with the full Software Delivery Lifecycle within complex enterprise environments
• 2-3 years of experience with modern Agile Development and DevOps methodologies
• 2-3 years of experience designing, building and automating the delivery of applications to Cloud Service Providers like AWS, Azure, GCP or Alibaba
• With AWS, 1-2 years of experience designing, building, and automating infrastructure and app deployments
• 1-2 years of experience creating and deploying Infrastructure as Code templates to create app infrastructure declaratively and idempotently (e.g. Terraform, CloudFormation, Packer, etc.)
• 1-2 years of experience creating and managing CI and CD pipelines (e.g. Jenkins, CircleCI, GitLab, AWS CodePipeline) that incorporate automated testing (Unit, BDD, Integration, etc.)
• 1-2 years of experience using Docker to containerize applications and promote immutable infrastructure
• 1-2 years of experience using Configuration Management tools to configure infrastructure and apps once deployed (e.g., Puppet, SaltStack, Ansible, etc.) where containerization isn’t possible
• 2-3 years of experience in deploying and troubleshooting software applications on Linux
• 2-3 years of experience with at least one programming or scripting language (e.g. Python, Node, Java, C#, PowerShell, Bash, etc.)
• 1-2 years of experience in networking fundamentals, diagnostics and troubleshooting (DNS, firewalls, etc.)
• Proven ability to investigate complex issues spanning multiple technologies and drive to completion
• Excellent communication (written and verbal) and interpersonal skills
• 1-2 years of experience with Terraform configs / modules, Terraform Enterprise
Nice to have
-
• Concourse
Languages
English: B2 Upper Intermediate
Seniority
Regular
Relocation package
If needed, we can help you with relocation process. Click here for more information.
APPLY
To help us track our recruitment effort, please indicate in your cover letter where (vacanciesinukraine.com) you saw this job posting.
