• Watch Interview of Chairman - JumpStart Pakistan
  • Post A Free Job

SRE Specialist

Job Description

Role: Senior Site Reliability Engineer

Exp: 7 to 15 yrs

Location: Greater Noida

Please apply if interested or share your resume at [HIDDEN TEXT]

Job Description -

The ideal candidates should have advanced coding skills in Python, Shell and YAML, preferably with a minimum of 3-5 years of experience in all of these or similar languages.

Candidates should have 7+ years experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables.

The role of Sr. Site Reliability Engineer is to support and enforce reliability elements into technological solutions that deliver an exceptional customer experience.

As part of Site Reliability Engineering team, you'll leverage your development background to promote a framework which will deliver optimal levels of performance and reliability throughout systems and services

Independently designs, implements, productionizes and maintains site reliability guidelines, processes and systems

Service Level Definition, Configuration and Measurement:

Define SLIs, SLOs & SLAs specific to each application or system:

Configuration of monitoring & alerting tools suitable for each product and/or platform team

Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing monitoring/alerting tools to drive continuous improvement based on data analysis

Incident Management

Facilitation of incident response through the engagement of various teams and stakeholders, while providing robust communication and visibility to the organization during service interruptions

Provide Root Cause Analysis for failures

Experience with a modern incident management platform to effectively drive incident response and problem resolution

Monitoring & Alerting

Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic, Splunk, AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time)

Build monitors and alerts designed to manage SLAs, optimize performance, and minimize outages

Construct E2E customer journey dashboards and alerts for customized transactions and applications

Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible, terraform, etc).

Work with product management team to contribute to -

i) The identification of reliability features & requirements

ii) Level of effort estimates

for more details, contact us at

kalpana.singh@coforgetech.com

Apply For This JOB
Industry :
Functional Area :
Engineering - Software
Location :
NOIDA , INDIA
Salary :
Market Competitive
Gender :
Any Gender
Work Type :
Full Time
Age :
20-30
Education :
Graduate
Years of Experience :
2-15
Apply By :
31 of Aug 2024

   Your application has been submitted successfully

More jobs from Coforge
Loading Results