NOC ENGINEER-INFRA
Location
Hyderabad, Telangana IN
Department
Product & Engineering
Job Role
Very good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWinds. Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
What you'll be responsible for?
- Design and development of security policies, standards, and procedures in accordance with organization goals.
- Responsible for proactive monitoring of alerts (Network, Infra, Applications) and taking corrective actions.
- Responsible for Incident Management life cycle & Service requests fulfilment
- Responsible for Incident logging, accurately tracks and documents all incidents.
- Adherence to the process compliance
- Adherence to the SLAs defined for the platform, Service uptime.
- Coordination with cross-group peers both proactively and reactively produces quality documentation and share with the appropriate team members.
- Responsible to develop SOP documents.
- Ability to deep dive into identifying the root cause of various service-impacting events and optimizing.
- Act as a First Point of Contact for incidents, escalations, and business-impacting technology issues
- To ensure the maximum possible service availability and performance of the platforms
- Responsible for continuous improvement of the process science.
Qualification and other skills
- Experience of 4-6 years in NOC
- Experience in Alert/Incident Management and a good understanding of SLAs
- Troubleshooting, Problem-solving & Strong presentation skills
- Analytical and communication skills
What you'd have?
- Strong knowledge of Linux, Network & database querying
- Knowledge of asset management
- Very good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWinds
- Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
- Strong Linux, Networking, Log analysing, and database querying skills.
- Must have experience with monitoring tools like Prometheus, Loki, Grafana, and Dynatrace & building monitoring dashboards.
- Experience in alerts mitigation & optimization - Knowledge of the ITIL framework
- Hands-on exp with observability tools will be an added advantage.
- Must have expertise in maintaining/updating asset management.
- Certifications: ITIL foundation, AZ-900, Shell Scripting, Python, Hardware & networking.
Why join us?
We thought you'd never ask! We offer all the usual stuff: competitive salary, flexible working hours, challenging product culture, But the real perks are:
- Challenging and fun work environment solving meaningful real-life business problems - you will never have a boring day at the office.
- World-class team who loves solving tough problems and have a bias for action
We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability, or veteran status.