Hi, I'm Rohit Verma

DevOps/SRE Lead • UPI @ Paytm • Cloud & Platform Engineering

DevOps/SRE leader with ~14 years of experience building reliable, cost-efficient platforms with focus on High Availability, Automation, and Microservices. Deep FinTech exposure on UPI at Paytm—owning Kubernetes/EKS, GitOps (Argo CD/Rollouts), service mesh (Istio/Nginx/HAProxy), autoscaling (KEDA), Kafka/Redis, ELK Stack, and observability (Prometheus/Grafana/Thanos). I make data-driven architectural decisions, drive migrations, implement Infrastructure as Code (Terraform/Ansible/SaltStack), harden security, and ship experiments safely. Strong communicator with proven ability to collaborate across engineering, security, and product teams. I also build for fun: bitresearch.ai and devxops.tech.

14+
Years of Experience
10+ years
Automation
AWS
Cloud Platforms
4+
Certifications

Career Highlights

Led the UPI migration from physical DC (PPBL - Paytm Payments Bank) to AWS Cloud—architecture, networking, data, cutover, and post-migration optimizations with High Availability (HA) design.

Progressive delivery with Argo Rollouts (canary/experiments) + KEDA autoscaling for traffic bursts in microservices architecture.

Security & compliance: IRSA, least-privilege IAM, TLS, CIS baselines, CSPM (Cloud Security Posture Management); strengthened security posture with certifications like PCI-DSS, ISO 27001, and SOC 2 compliance.

Automation & IaC: Implemented comprehensive Infrastructure as Code with Terraform, Ansible, and SaltStack; automated 90% of infrastructure provisioning.

Cost & reliability: right-sizing, ALB/NLB logging strategy, S3 lifecycles; MTTR/SLA dashboards in Grafana; data-driven decisions for capacity planning.

Hands-on with Istio, Kafka, Redis, Aerospike, Terraform, Helm, Jenkins, Nginx, HAProxy, ELK Stack, Thanos.

Successfully handled multiple production incidents with quick resolution, root cause analysis, and preventive measures.

Strong cross-functional collaboration with engineering, security, compliance, and product teams to deliver business-critical solutions.

Skills Distribution

View All

Skills by Category

Career Highlights

140M+
Daily Transactions
High-scale UPI platform
30%
Latency Reduction
Performance optimization
25%
Cost Savings
Infrastructure efficiency
40%
MTTR Improvement
Reliability & observability
~14+
Years Experience
50+
Microservices
5
Global Companies

In-Demand Skills

Kubernetes/EKS
Argo CD
Argo Rollouts (Canary/Experiments)
Istio Service Mesh
AWS (EC2, VPC, NLB/ALB, S3, ECR, IAM/IRSA, CloudWatch...)
Terraform
Helm
Kafka
Aerospike
Redis
Elasticsearch
Prometheus/Grafana
ELK Stack
Thanos
Alertmanager
GitOps
SRE/Resilience
Jenkins
Ansible
SaltStack
KEDA Autoscaling
DR/Failover
Cost Optimization
Python
Golang (K8s controllers)
Nginx
HAProxy
High Availability (HA)
Data-Driven Decisions
Automation
Infrastructure as Code (IaC)
Microservices

Ask Me About...

How did you migrate UPI from DC to AWS without downtime?

Click to reveal

What are you exploring now?

Click to reveal

Favorite tool combo?

Click to reveal

Let's Build Something Amazing

Looking for a DevOps/SRE leader who can architect cloud platforms, drive migrations, and build resilient systems? Let's talk.