Aviso AI

Published on 06/03/2026
Ludhiana (041)
To be defined

Description:

Job Title: DevOps Manager

Location: Remote

Employment Type: Full-time

Role Type: Individual Contributor (IC)

About Aviso AIL: Aviso is an AI-driven Revenue Intelligence platform that helps Sales and GTM teams close more deals, accelerate growth, and make smarter decisions. With offices in the US and India, we partner with global leaders like Dell, Honeywell, MongoDB, Splunk, and RingCentral. Backed by top Silicon Valley investors, Aviso combines data science and predictive insights to help enterprises optimize performance, exceed revenue goals, and solve complex sales challenges.

Responsibilities:

Lead the design, implementation, and management of infrastructure-as-code using Pulumi and AWS CDK, ensuring scalable and reliable provisioning of AWS resources (ECS, EC2, VPC/networking, Lambda, RDS/Postgres, self-managed MongoDB, Kubernetes clusters (EKS)).
Drive the development of Python-based tools and automation frameworks to streamline provisioning, deployments, and operational workflows (backups, migrations, and system maintenance).
Own and evolve CI/CD strategy using GitHub Actions, including build pipelines, automated testing, security validations, and environment promotion workflows.
Oversee the operation, performance, and security of Linux-based systems (EC2 and container hosts and Kubernetes worker nodes), ensuring best practices in OS tuning, logging, and monitoring.
Partner with engineering teams to architect and deliver scalable, secure services across ECS (Fargate/EC2), EC2, and Lambda, Kubernetes-based microservices platforms, with a strong focus on networking (VPCs, subnets, security groups, load balancers).
Provide leadership in database operations (Postgres and MongoDB), including provisioning, configuration, performance optimization, backup strategies, and high-availability architecture.
Design, deploy, and manage Kafka-based streaming platforms, including cluster setup, topic management, scaling, and ensuring high availability and fault tolerance.
Champion observability practices by implementing robust monitoring, logging, and alerting systems; lead incident response and drive postmortem analysis for continuous improvement.
Mentor and guide DevOps/SRE team members, fostering best practices in automation, reliability, and infrastructure management.

Requirements:

12+ years of experience in DevOps, SRE, Platform Engineering, or backend systems with demonstrated leadership or team management exposure.
Strong proficiency in Python for automation, scripting, and building internal tools.
Deep hands-on experience with AWS services (ECS, EC2, Lambda) and strong understanding of networking fundamentals (VPC, subnets, security groups, ALB/NLB).
Proven experience with infrastructure-as-code tools, specifically Pulumi (Python) and AWS CDK, in production environments.
Strong Linux fundamentals, including systemd, networking, and advanced troubleshooting in production systems.
Expertise in building and managing CI/CD pipelines using GitHub Actions.
Solid understanding of Postgres and MongoDB operations in cloud environments.
Hands-on experience with Kubernetes (cluster management, deployments, Helm, autoscaling) and Kafka (cluster operations, performance tuning, and stream processing concepts). 2

The processing of personal data received will be carried out in accordance with applicable laws, including the UK General Data Protection Regulation (UK GDPR) and the Data Protection Act 2018.