JobVacancyResult - Job Vacancy in Site Reliability Engineer-L1. Experience: 0-1 Year. Location: Bangalore, Karnataka, India

Job Description

Brief Description:

Seeking a full-time Engineer to join SRE team in the Bengaluru, India office. For this role, we are looking for a self-motivated talented individual who can demonstrate in-depth technical expertise necessary to monitor the alerts, resolve issues on a day to day basis. A successful candidate must have the ability to work independently, communicate effectively, and successfully manage a diverse set of responsibilities in troubleshooting and guiding the team technically on problem solving.

Roles and Responsibilities:

0.6 to 2 years of experience in application production support environment with ability to solve complex problems and SRE role.
Responsible for reliability of our end-to-end data infrastructure.
Collaborate with other teams members like Observability, CIO teams so that we can design our systems for better monitoring. That way, we can catch Incidents before customers report.
Tackle issues across the entire stack - hardware, software, application and network.
Analyze and troubleshoot application issues in a timely fashion and help incident support team as necessary to restore services and prevent from happening again.
Assist in maintenance and upgrades of existing software applications.
involve in projects, prioritizing and executing assigned tasks/projects within deadlines.
Assist in risk assessment and mitigation activities.
Ready to work in night and weekend on demand basis.
Ability to multi-task and manage multiple projects/tasks effectively within deadlines.
Automate existing manual tasks so that we gain order of magnitude efficiency and effectiveness gains by building tools/services with the aim on self-serve and auto-heal.
Accountable and technical owner for ensuring SRE readiness for new modules that need to be supported from various angles like monitoring, adequate technical onboarding trainings, preparedness to handle incidents and continuous optimizations of existing modules.

Skills:

Adept on Linux platform.
Experience with Docker, Kubernetes, AWS services like S3, SQS, Lambda, EC2, EKS, etc and expert understanding of best practices.
Knowledge in debugging like java and node applications Memory utilization, Analysis of thread dumps, heap dumps
Exposure on Networking, load balancers, Messaging Queue and database fundamentals (preferably PostgreSQL and knowledge of Oracle and NoSQL DBs like Mongo DB).
Knowledge on working in programming languages like python, shell, Perl, java, etc.
Knowledge in handling issues across the entire stack - hardware, software, application and network.
Knowledge on JBoss, Tomcat, Spring boot, etc
Knowledge in monitoring tools like Splunk, New Relic, Sensu, Foglight, etc.
Knowledge of Incident, problem, Change management
Knowledge in APM tools like New Relic
Knowledge on IBM MQ/Kafka/Redis/Elastic Search.
Take up current monitoring two notches higher and ensure operations team to be able to detect all critical issues before customer.
Have a systematic problem-solving approach, coupled with strong communication and analytical skills and a sense of ownership, initiative, grit, and drive.
Exposure to CI/CD, GitLab, JIRA, Service Now
Exposure to UI technologies
Good to have certifications like ITIL, AWS

Skills Required: Linux Troubleshooting,Application Support

Site Reliability Engineer-L1

Share via

Job Description

JOBS BY CATEGORY

Location

IT Jobs

Non IT Jobs

Roles

Other Jobs

JOBS BY CATEGORY

Delhi

Mumbai

Bengaluru

Kolkata

Chennai

More Jobs

Web Developer

Marketing

Designer

SEO

Coder

More Jobs

Accountant

Call center

Hotel

Sales

Content writing

More Jobs

Air hostess

Data Analyst

Business Analyst

Networking

Accountant

More Jobs

Walkins

Fresher

Freelance

Part time

Contract

More Jobs

ABOUT US