About Role: Very good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWinds. Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
What’ll be Responsible for?
-
Design and development of security policies, standards, and procedures in accordance with organization goals.
-
Responsible for proactive monitoring of alerts (Network, Infra, Applications) and taking corrective actions.
-
Responsible for Incident Management life cycle & Service requests fulfilment
-
Responsible for Incident logging, accurately tracks and documents all incidents.
-
Adherence to the process compliance
-
Adherence to the SLAs defined for the platform, Service uptime.
-
Coordination with cross-group peers both proactively and reactively produces quality documentation and share with the appropriate team members.
-
Responsible to develop SOP documents.
-
Ability to deep dive into identifying the root cause of various service-impacting events and optimizing.
-
Act as a First Point of Contact for incidents, escalations, and business-impacting technology issues
-
To ensure the maximum possible service availability and performance of the platforms
-
Responsible for continuous improvement of the process science.
What you’d have?
-
Experience of 4-6 years in NOC
-
Experience in Alert/Incident Management and a good understanding of SLAs
-
Troubleshooting, Problem-solving & Strong presentation skills
-
Analytical and communication skills
-
Strong knowledge of Linux, Network & database querying
-
Knowledge of asset management
-
Very good knowledge of Log analysing & monitoring tools like Prometheus, Loki, Dynatrace, Grafana & SolarWinds
-
Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
-
Strong Linux, Networking, Log analysing, and database querying skills.
-
Must have experience with monitoring tools like Prometheus, Loki, Grafana, and Dynatrace & building monitoring dashboards.
-
Experience in alerts mitigation & optimization - Knowledge of the ITIL framework
-
Hands-on exp with observability tools will be an added advantage.
-
Must have expertise in maintaining/updating asset management.
-
Certifications: ITIL foundation, AZ-900, Shell Scripting, Python, Hardware & networking.
Why Join us?
We thought you'd never ask! We offer all the usual stuff: competitive salary, flexible working hours, challenging product culture, But the real perks are:
-
Challenging and fun work environment solving meaningful real-life business problems - you will never have a boring day at the office.
-
World-class team who loves solving tough problems and have a bias for action.
-
Tanla is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability, or veteran status.