- Job Search
- IT Jobs
- SRE Specialist
Similar Jobs
SRE Specialist
Job Description
Role: Senior Site Reliability Engineer
Exp: 7 to 15 yrs
Location: Greater Noida
Please apply if interested or share your resume at [HIDDEN TEXT]
Job Description -
The ideal candidates should have advanced coding skills in Python, Shell and YAML, preferably with a minimum of 3-5 years of experience in all of these or similar languages.
Candidates should have 7+ years experience in SRE and either or both of the following roles: DevOps, Software Engineering, leveraging automation extensively to achieve key deliverables.
The role of Sr. Site Reliability Engineer is to support and enforce reliability elements into technological solutions that deliver an exceptional customer experience.
As part of Site Reliability Engineering team, you'll leverage your development background to promote a framework which will deliver optimal levels of performance and reliability throughout systems and services
Independently designs, implements, productionizes and maintains site reliability guidelines, processes and systems
Service Level Definition, Configuration and Measurement:
Define SLIs, SLOs & SLAs specific to each application or system:
Configuration of monitoring & alerting tools suitable for each product and/or platform team
Measure reliability & resilience (through pre-defined SLIs & SLOs) utilizing monitoring/alerting tools to drive continuous improvement based on data analysis
Incident Management
Facilitation of incident response through the engagement of various teams and stakeholders, while providing robust communication and visibility to the organization during service interruptions
Provide Root Cause Analysis for failures
Experience with a modern incident management platform to effectively drive incident response and problem resolution
Monitoring & Alerting
Debug defects as well as develop dashboards using modern monitoring tools (e.g. New Relic, Splunk, AIOPs) to enable a reduction in mttd (detection time) & mttr (resolution time)
Build monitors and alerts designed to manage SLAs, optimize performance, and minimize outages
Construct E2E customer journey dashboards and alerts for customized transactions and applications
Automates reliability requirements into system and application implementations and updates; including the implementation of self-healing solutions (ansible, terraform, etc).
Work with product management team to contribute to -
i) The identification of reliability features & requirements
ii) Level of effort estimates
for more details, contact us at
kalpana.singh@coforgetech.com
   Your application has been submitted successfully
Thanks for submitting the application, Please check your email and Goodluck!
You have already been applied on this job.
Email Me Job
Delete Office
Are you sure, you wish to delete?
Job Application
Are you sure, you wish to delete?
Job Portal
Quickest way to apply and increase your chances of getting shortlisted! Please make sure your profile is up to date before your apply.
Apply for - Upload/Choose documents
Resume
Upload your Resume
Cover letter
You Can Upload Image Files (.png, .jpeg or .gif), .pdf and .docx Files.
You Need To Provide At Least Your Resume(.pdf,.docx and .doc) To Submit An Application.
Apply for - Create an Account
Already have an account?SignIn
New to careerz360.com?SignUp