ANKIT N PATEL

Site Reliability Engineer & Splunk Specialist

6+ Years Experience
15+ Certifications
2000+ Servers Managed
Download Resume


Ankit N Patel

Ankit N Patel

Site Reliability Engineer & Splunk Specialist

I am a person who is positive about every aspect of life. There are many things I like to do, to see, and to experience.

Technical Skills

Operating Systems

Windows Server RHEL Ubuntu

Monitoring & Observability

Splunk Core/ITSI/ES Prometheus Grafana Zabbix Icinga

Splunk Expertise

Deployment & Configuration SPL Query Development ITSI Glass Tables KPI Monitoring Log Onboarding Performance Testing

DevOps & GitOps

CI/CD (GitLab, Jenkins) GitOps (ArgoCD, Helm) Kubernetes/GKE Release Management

Cloud & IaC

GCP AWS Oracle Cloud Azure Terraform

Automation & Scripting

Bash Scripting Python PHP Terraform

Professional Summary

Results-driven Site Reliability Engineer with 6 Years, 7 Months and 28 Days of hands-on experience in Splunk Administration, DevOps, and Cloud Infrastructure Management. Skilled in designing, implementing, and maintaining high-availability, scalable systems across GCP, AWS, and Oracle Cloud environments.

Expertise in Kubernetes, ArgoCD, GitOps workflows, CI/CD pipelines, and Infrastructure as Code (Terraform). Strong proficiency in monitoring and observability using Splunk ITSI, Prometheus, and Grafana, coupled with automation skills in Bash and Python.

Proven track record of enhancing system reliability, performance, and operational efficiency through SRE best practices and cross-functional collaboration.

Key Performance & Impact Highlights

🚀
Client Impact

Implemented enterprise-wide SRE practices for HSBC, Specsavers, and Kindred Group across EMEA, Asia, US, and Australia, delivering 50+ end-to-end Splunk use cases that improved incident detection and reduced MITR by 30–40%.

📈
Reliability & Observability Impact

Defined SLIs, SLOs, and error budgets for critical services, improving system uptime to 99.95%. Implemented advanced monitoring frameworks using Splunk ITSI, Prometheus, and Grafana to ensure proactive detection of performance and availability issues.

⚙️
Automation & DevOps Impact

Developed 50+ Bash/Python/PHP/Terraform scripts for operational automation, CI/CD pipelines, and GitOps workflows using ArgoCD, Helm, and Kubernetes. Streamlined cloud deployments across GCP, reducing manual intervention and deployment times by 25–30%.

🖥️
Infrastructure & Technical Impact

Managed ~2000+ servers, including patching, high-availability configurations, LVM, DNS, and SSL certificate management. Deployed and optimized 30+ Splunk instances supporting 50+ critical systems, improving observability and operational efficiency.

👥
Team Leadership & Knowledge Impact

Mentored 10+ junior engineers, conducted SME sessions, and delivered hands-on workshops on SRE best practices, monitoring, and automation, reducing onboarding time by 25% and fostering a culture of reliability and proactive incident management.

Work Experience

Site Reliability Engineer - Splunk

Miratech India Pvt Ltd, Bengaluru, India
June 2024 - Present
Client: HSBC Bank | Region: EMEA, Asia

Overview: HSBC is a leading global banking and financial services organisation, headquartered in London, operating across 60+ countries, serving personal, corporate, and investment clients with a focus on customer-centricity, global connectivity, and sustainable growth.

  • Splunk & Monitoring Expertise: Extensive experience with Splunk Core and ITSI, including deployment, configuration, optimization, and dashboard development using complex SPL queries
  • Site Reliability Engineering & Observability: Introduced SRE practices, defining SLIs/SLOs, establishing KPIs, implementing error-budget tracking, and improving system resilience
  • DevOps & GitOps Practices: Designed and implemented GitOps workflows using GitLab, Helm, GKE, and ArgoCD with hands-on experience in CI/CD pipelines and Infrastructure as Code (Terraform)
  • Automation & Scripting: Developed Bash scripts to automate operational tasks and reduce manual intervention; experience with Python for microservices automation
  • Cloud & Microservices: Experienced with GCP, microservices architecture, and monitoring using Prometheus and Grafana for scaling and high availability
  • Team Leadership & SME Activities: Mentored junior team members, conducted knowledge-sharing sessions, and acted as SME for Splunk applications
  • Use Case Management & Solution Design: Designed and delivered end-to-end Splunk use cases including alerting, dashboards, and service health monitoring
  • Collaboration & Stakeholder Engagement: Partnered with stakeholders to collect requirements and deliver technical sessions, demos, and training

Splunk Application Developer SRE

Accenture, Gujarat-Pune, India
March 2022 - June 2024
Client: Specsavers | Region: EMEA

Overview: Specsavers is a leading multinational optical and audiology retailer headquartered in the UK, with over 2,200 stores and 40,000 employees across Europe, the Middle East, and Africa (EMEA). The company provides eye care, hearing services, and eyewear solutions.

  • Developed Splunk applications and knowledge objects tailored to business requirements, enhancing monitoring and alerting capabilities
  • Hands-on experience with Splunk Cloud and Splunk Enterprise Security (ES) for large-scale, multi-tenant deployments
  • Collaborated with stakeholders during Agile sprints, translating functional requirements into actionable monitoring deliverables
  • Created and fine-tuned complex SPL-based alerts, integrating with ServiceNow, Jira, Microsoft Teams, and email
  • Designed Glass Tables, KPIs, and service health monitoring using Splunk ITSI, boosting observability and real-time insights
  • Implemented database monitoring with DB Connect App, and built dynamic dashboards using XML, SPL, and Grafana
  • Led data onboarding from Linux and Windows platforms, and integrated network device monitoring through Splunk and Zabbix
  • Managed CI/CD and deployment pipelines across GCP, AWS, and Oracle Cloud, ensuring consistent, zero-downtime rollouts
  • Implemented Infrastructure as Code (Terraform) for automated provisioning and configuration
  • Applied GitOps practices using GitLab, Helm, and ArgoCD to enhance deployment consistency
  • Advocated SRE principles—defined SLIs/SLOs, error budgets, and performance KPIs to enhance system reliability

System Administration & Site Reliability Engineer [SRE]

Tata Consultancy Services Limited, Gujarat, India
June 2019 - March 2022
Client: Kindred Group – Unibet | Region: EMEA, US, Australia

Overview: Kindred Group plc is a leading online gambling operator headquartered in Malta, offering multiple brands (including Unibet and 32Red) across regulated markets in Europe and Australia. The company provides sports betting, casino & games, poker, and bingo via a proprietary, scalable multi-brand platform.

  • Engineered and supported Linux infrastructure and configuration management systems, ensuring enterprise-grade performance, security, and high availability
  • Managed centralized Linux repositories for automated patching, OS updates, and software deployments across client environments
  • Built and configured virtual (RHEVM) and physical servers (Supermicro, Fatwin, SuperMicroCloud) with RAID setups
  • Administered and maintained Splunk Enterprise — configured indexers, search heads, user roles, storage, and onboarded logs
  • Conducted proactive Splunk health checks using the Monitoring Console and integrated alerting with Grafana and Icinga
  • Managed DNS, SSL certificates (via Trustwave), and Linux user/group administration for secure access management
  • Implemented system backups and recovery plans using Commvault, enhancing data protection and disaster-recovery readiness
  • Participated in DevOps workflows, using Jira, Confluence, Bitbucket, Git, and Jenkins for version control and CI/CD automation
  • Assisted in early automation initiatives using Bash scripting to streamline maintenance and reporting tasks
  • Delivered on-call production support and rotational coverage, maintaining SLA adherence and high uptime

Education & Certifications

Academic Qualification

Bachelor of Computer Applications (BCA) – Development & Networks

Gujarat University, Ahmedabad – 2016 to 2019 (Graduated with Distinction)

Completed 14 years of formal education (Primary to Graduate)

Training & Certification

Google Cloud Certified – Associate Cloud Engineer

Google

Oracle Cloud Infrastructure 2023 Certified Foundations Associate

Oracle

Microsoft Certified – Azure AI Fundamentals (AI-900)

Microsoft

DevOps Training

Nisarg Software Ltd, Gujarat, India

Udemy Certifications (11+ IT Certifications)

DevOps CI/CD, RHCSA, Splunk Administration, Agile Project Management, Computer Forensics

LinkedIn Learning: PHP Essential Training

LinkedIn

Download Resume