MLOps Engineer

Location

Hyderabad, Telangana IN

Department

Product & Engineering

Job Role

‍

We are looking for an MLOps Engineer with strong experience in deploying, scaling, and monitoring machine learning models in production environments. This role focuses on building reliable ML pipelines, optimizing model inference performance, and ensuring end-to-end lifecycle management of ML systems.

What you'll be responsible for?

‍

Build, deploy, and scale ML models using Docker and Kubernetes (EKS/AKS).

Develop and manage CI/CD pipelines using GitHub Actions / Jenkins.

Design and maintain low-latency, high-throughput REST/gRPC services for ML model serving.

Implement model serving and inference using vLLM, Triton Inference Server, or TGI.

Optimize inference performance by tuning batch sizes, managing GPU memory, and improving throughput.

Implement logging of prompt/response pairs, monitor model performance, and detect model drift.

Set up monitoring and observability using LangFuse, Helicone, or similar tools.

Build and manage ML pipelines and experiment tracking using MLflow or Weights & Biases.

Develop automated retraining pipelines and manage model versioning and registry.

Build and maintain data pipelines for training data ingestion, PII handling, and versioning using DVC.

Work with data storage systems like ClickHouse or similar columnar databases.

Qualification and other skills

‍

Familiarity with Hugging Face ecosystem.

Experience with GPU driver and CUDA troubleshooting.

Experience in cost optimization for cloud GPU workloads or on-prem infrastructure.

What you'd have?

‍

4–6 years of experience in MLOps / ML Engineering roles.

Strong programming skills in Python (mandatory) and working knowledge of Bash.

Hands-on experience with Kubernetes, Docker, and CI/CD pipelines.

Experience in building and scaling REST/gRPC APIs and services.

Strong understanding of system design for high-performance, scalable systems.

Experience with Git-based version control workflows.

Hands-on experience with model serving tools (vLLM, Triton, TGI).

Strong understanding of ML lifecycle, model monitoring, and observability.

Experience with experiment tracking tools (MLflow or Weights & Biases).

Basic data engineering knowledge including data pipelines and versioning.

Strong problem-solving skills and ability to work in a fast-paced environment.

Why join us?

Impactful Work: Build and scale ML systems that power intelligent, real-time communication platforms.

Tremendous Growth Opportunities: Work on cutting-edge AI/ML technologies and advance your expertise in MLOps.

Innovative Environment: Be part of a fast-paced, AI-driven ecosystem focused on experimentation and continuous improvement.

‍

Tanla is an equal opportunity employer. We champion diversity and are committed to creating an inclusive environment for all employees.

Connect to join our team

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Company

Resources

Values

Latest from Tanla

Be the first to know about latest developments and news from Tanla by subscribing to our email alert.

By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Deny Accept