Envestnet | Yodlee

Site Reliability Engineer-L1

Envestnet | Yodlee
200000 - 450000 P.A.
0-1 Years Full Time
Bangalore, Karnataka, IN

Vacancy: Not Disclosed Posted: 2 years ago Applicants: 0
Share via

Job Description

Brief Description:

Seeking a full-time Engineer to join SRE team in the Bengaluru, India office. For this role, we are looking for a self-motivated talented individual who can demonstrate in-depth technical expertise necessary to monitor the alerts, resolve issues on a day to day basis. A successful candidate must have the ability to work independently, communicate effectively, and successfully manage a diverse set of responsibilities in troubleshooting and guiding the team technically on problem solving.

Roles and Responsibilities:

  • 0.6 to 2 years of experience in application production support environment with ability to solve complex problems and SRE role.
  • Responsible for reliability of our end-to-end data infrastructure.
  • Collaborate with other teams members like Observability, CIO teams so that we can design our systems for better monitoring. That way, we can catch Incidents before customers report.
  • Tackle issues across the entire stack - hardware, software, application and network.
  • Analyze and troubleshoot application issues in a timely fashion and help incident support team as necessary to restore services and prevent from happening again.
  • Assist in maintenance and upgrades of existing software applications.
  • involve in projects, prioritizing and executing assigned tasks/projects within deadlines.
  • Assist in risk assessment and mitigation activities.
  • Ready to work in night and weekend on demand basis.
  • Ability to multi-task and manage multiple projects/tasks effectively within deadlines.
  • Automate existing manual tasks so that we gain order of magnitude efficiency and effectiveness gains by building tools/services with the aim on self-serve and auto-heal.
  • Accountable and technical owner for ensuring SRE readiness for new modules that need to be supported from various angles like monitoring, adequate technical onboarding trainings, preparedness to handle incidents and continuous optimizations of existing modules.

Skills:

  • Adept on Linux platform.
  • Experience with Docker, Kubernetes, AWS services like S3, SQS, Lambda, EC2, EKS, etc and expert understanding of best practices.
  • Knowledge in debugging like java and node applications Memory utilization, Analysis of thread dumps, heap dumps
  • Exposure on Networking, load balancers, Messaging Queue and database fundamentals (preferably PostgreSQL and knowledge of Oracle and NoSQL DBs like Mongo DB).
  • Knowledge on working in programming languages like python, shell, Perl, java, etc.
  • Knowledge in handling issues across the entire stack - hardware, software, application and network.
  • Knowledge on JBoss, Tomcat, Spring boot, etc
  • Knowledge in monitoring tools like Splunk, New Relic, Sensu, Foglight, etc.
  • Knowledge of Incident, problem, Change management
  • Knowledge in APM tools like New Relic
  • Knowledge on IBM MQ/Kafka/Redis/Elastic Search.
  • Take up current monitoring two notches higher and ensure operations team to be able to detect all critical issues before customer.
  • Have a systematic problem-solving approach, coupled with strong communication and analytical skills and a sense of ownership, initiative, grit, and drive.
  • Exposure to CI/CD, GitLab, JIRA, Service Now
  • Exposure to UI technologies
  • Good to have certifications like ITIL, AWS

Skills Required: Linux Troubleshooting,Application Support


JOBS BY CATEGORY