💼
WORK EXPERIENCE
Site Reliability Engineer (SRE)
Innfinit | Nov. 2022 - Present
Site Reliability Engineer with 2+ years in observability and monitoring of critical systems for Fortune 500 enterprises. Specialized in golden signals monitoring, incident response, and maintaining high availability for mission-critical applications.
PROJECT: CN/Delta Monitoring (Leadership) - Sovos Enterprise
  • Led observability architecture implementation for global tax compliance infrastructure
  • Configured Kubernetes monitoring using Prometheus Operator and kube-state-metrics (test/lab)
  • Developed Grafana Cloud dashboards for: Kubernetes, RabbitMQ, Redis, AWS RDS, S3
  • Implemented optimized alert system with detailed runbooks to minimize false positives
  • Managed project with reduced team, delivering quality results on time
PROJECT: SRE Analysis & Correlation - Liberty Mutual US (Fortune 100)
  • Conducting root cause analysis and post-mortem investigations using BigPanda platform
  • Contributing to golden signals monitoring (latency, traffic, errors, saturation) for financial services
  • Supporting SLI/SLO framework development and service health thresholds under senior guidance
  • Analyzing Splunk monitoring and supporting Datadog migration for enhanced reliability visibility
  • Optimizing correlation logic and global standards to improve noise reduction and detection accuracy
  • Creating standardized documentation in Confluence to expedite resolution and improve MTTR
PROJECT: Comprehensive Observability Platform - Healthcare Systems
  • Implemented high-availability architecture with Grafana and Prometheus for production monitoring
  • Configured synthetic monitors to validate availability and user experience in critical applications
  • Implemented centralized logging with Loki for multiple microservices
  • Integrated notification channels (Slack, Teams, Email, PagerDuty) for incident response
  • Built modular infrastructure optimizing resources while maintaining capacity for growth
Technical Support
Recomin SM | Mar. 2019 - Nov. 2022
Comprehensive technical support for enterprise mining operations, building foundational troubleshooting and infrastructure management skills.
Los Bronces Integrated Project - Technical Support
  • Provided comprehensive technical support across administrative, HR (Nubox), risk prevention, machinery, procurement and operations departments
  • Troubleshooted complex technical issues with enterprise computers, network printers, and network infrastructure
  • Configured Windows OS environments and Microsoft Office suite for operational efficiency
  • Managed router configurations and network connectivity for critical mining operations
  • Collaborated with cross-functional teams to maintain operational continuity in enterprise environment
📧
CONTACT
📱 +56 9 6414 2352
📧 fabianignaciomv@gmail.com
🔗 linkedin.com/in/fabianimv
🏆 credly.com/users/fabianimv
🛠️
TECHNICAL SKILLS
CLOUD PLATFORMS
AWS: EC2, RDS, S3, VPC, Lambda
Google Cloud Platform:HA Grafana platform implemented/integrated with GCE, GCS, Google Cloud SQL
Containers: Kubernetes and Docker (lab/test + production monitoring)
OBSERVABILITY
Datadog APM, Grafana Cloud, Prometheus, AWS CloudWatch, Grafana Exporters, BigPanda, Splunk
AUTOMATION
Terraform (IaC), Jenkins Pipeline, Git/GitHub Actions
TOOLS
Confluence, Jira, ServiceNow, SNMP Exporter, PagerDuty
LANGUAGES & FRAMEWORKS
Python, Bash, YAML/HCL, SQL, NodeJs
METHODOLOGIES
SRE (Golden Signals, SLI/SLO), DevOps, Scrum, Incident Analysis, Post-Mortem Investigations
🏆
CERTIFICATIONS
AWS Cloud Practitioner (2025)
Microsoft Azure AI Fundamentals (2024)
OCI Foundations Associate (2023)
Scrum Foundation Professional
🎓
EDUCACIÓN
Computer Engineering
Instituto Profesional Duoc UC
2022-2026 | Currently in 4th year (Last)
Computer Programming Analyst
Instituto Profesional Duoc UC
2022-2024 | Final project: "True Q" Exchange Application | Grade: 60/70
🌐
LANGUAGES
Spanish: Native
English: B2 Level (Upper Intermediate)