Job BriefWe are seeking a highly experienced
Senior DevOps Engineer with deep, hands-on expertise in managing large-scale applications within
Kubernetes environments. This role is pivotal in driving our infrastructure's performance, reliability, and cost-efficiency.
VentureDive OverviewFounded in 2012 by veteran technology entrepreneurs from MIT and Stanford, VentureDive is the fastest-growing technology company in the region that develops and invests in products and solutions that simplify and improve the lives of people worldwide. We aspire to create a technology organization and an entrepreneurial ecosystem in the region that is recognized as second to none in the world.
Key Responsibilities:- Debug, maintain, and deploy applications to Kubernetes clusters via Flux CD.
- Build, maintain, and optimize observability, monitoring, and alerting infrastructure, leveraging tools such as Grafana, Prometheus, Loki, and Tempo.
- Collaborate with development teams to create CI/CD pipelines for automated deployments.
- Develop and implement fault-tolerant infrastructure testing processes.
- Perform root cause analysis on system failures and service interruptions to ensure high availability and reliability.
- Optimize infrastructure for performance, reliability, and cost.
- Stay up-to-date with new tools and technologies to continuously improve our DevOps practices.
Required Qualification & Technical Skills:- Proven experience deploying and managing applications in Kubernetes.
- Strong knowledge of observability, monitoring, and alerting tools (Grafana, Prometheus, Loki, Tempo).
- Solid understanding of Linux operating systems and cloud platforms (AWS, GCP, or Azure).
- Proficiency with CI/CD pipelines (GitHub or similar).
- Hands-on experience in debugging and root cause analysis for large-scale applications.
- Familiarity with infrastructure-as-code tools (e.g., Terraform, Ansible, Helm).
- Scripting and automation skills (e.g., Bash, Python, or similar).
- Good to have Kubernetes certifications
Bonus Skills:- Experience in performance tuning and load testing.
- Familiarity with security best practices in a cloud environment.
- Knowledge of OpenTelemetry and tracing technologies.
- Experience with fault-injection testing and chaos engineering.
- Contributions to open-source DevOps tools or platforms.
What we look for beyond required skills
In order to thrive at VentureDive, you
…are intellectually smart and curious
…have the passion for and take pride in your work
…deeply believe in VentureDive’s mission, vision, and values
…have a no-frills attitude
…are a collaborative team player
…are ethical and honest
Are you ready to put your ideas into products and solutions that will be used by millions?
You will find VentureDive to be a quick pace, high standards, fun and a rewarding place to work at. Not only will your work reach millions of users world-wide, you will also be rewarded with competitive salaries and benefits. If you think you have what it takes to be a VenDian, come join us ... we're having a ball!
#LI-Hybrid