What You’ll Do
As a Senior DevOps & Solutions Architect, you will be a key leader in designing, building, and scaling the core infrastructure that powers our agentic environment. This isn’t just about maintaining systems; it’s about architecting the future of our enterprise-grade AI solutions. You will be a central partner to product teams and client-facing squads, providing technical leadership and strategic direction to ensure our platform is robust, scalable, and secure.
InteractiveAI runs on two high-performance engines: Product Teams that craft and scale our Agentic IDE, and Implementation Squads that ship high-impact, domain-specific AI solutions. Depending on your craft and ambition, you’ll join the team where you can create outsized value—and you’ll have a transparent, performance-based path to growth and rewards.
- Architect and scale multi-tenant, cloud-agnostic runtimes (Kubernetes/GPU clusters) supporting on-prem, VPC, and hybrid installations.
- Design and implement secure, end-to-end CI/CD pipelines for automating complex ML workflows, from data ingestion and fine-tuning (LoRA/QLoRA) to secure, high-stakes deployments.
- Provide solutions architecture expertise by partnering with product and client performance squads to accelerate the journey of custom agents from sandbox (≤ 5 days) to production (4–6 weeks), meeting tight SLAs.
- Lead the adoption of infrastructure-as-code best practices using tools like Terraform, Ansible, or similar.
- Define and manage the strategy for our containerized workloads (Docker, Kubernetes, etc.) to optimize for performance, cost, and reliability.
- Establish and enforce security, compliance, and data governance standards, particularly for enterprise clients.
- Mentor junior engineers and provide strategic guidance on infrastructure design, incident response, and system reliability.
What We’re Looking For
We’re seeking a seasoned architect who can lead the design and implementation of a robust, scalable infrastructure for our agentic platform and its ecosystem of solutions. You should have a proven track record of architectural leadership, strong fundamentals, and a deep understanding of operational maturity.
Minimum Requirements:
- 5+ years of experience in DevOps, Site Reliability, or Infrastructure Engineering roles, with at least 2 years in a solutions or systems architect capacity.
- Proven experience deploying and managing complex AI/ML production workloads on at least one major public cloud (e.g., AWS, GCP, or Azure).
- Extensive experience designing, deploying, and managing robust, resilient, and distributed cloud solutions at scale.
- Deep expertise in containerization and orchestration (Docker, Kubernetes).
- Strong track record of building and managing advanced CI/CD pipelines for complex software and ML lifecycles.
- Expert-level proficiency with infrastructure-as-code tools (Terraform, CloudFormation, or Pulumi).
- Strong scripting and automation skills (Python, Bash, or similar).
- Extensive experience with monitoring and logging stacks (e.g., Prometheus, Grafana, ELK).
- Exceptional communication and collaboration skills with a proven ability to lead and influence cross-functional teams.
Additional Requirements:
- Experience with ML/AI-specific infrastructure and MLOps tooling (e.g., MLflow, Weights & Biases).
- Demonstrated experience implementing security practices and compliance frameworks (e.g., GDPR, ISO 27001) in highly regulated environments.
- Previous work in enterprise-grade or highly regulated industries is a significant plus.
Interview Process
We keep our process focused and respectful of your time. Most candidates complete it in 2–3 weeks. Here’s what to expect:
- Intro Call – 30 minutes with our team to align on fit and expectations.
- Take-Home Challenge – A practical, real-world architecture design task.
- Technical Interview – Deep dive into the challenge, technical expertise, and architectural philosophy.
- Cultural and Values Interview – Discussion on motivation, cultural, and value alignment.
- Offer – Final conversation and offer.
We’re building a team of builders, people who care about impact, quality, and growth. If that’s you, let’s talk
careers@interactive.ai
About us
InteractiveAI is a fast-growing startup on a mission to empower enterprises with fully managed AI agent lifecycles.
We are building the next generation of enterprise-AI solutions, delivering an end-to-end Agentic IDE alongside an extensible ecosystem of agentic resources and solutions.
Our platform allows companies to orchestrate, monitor, evaluate, deploy and improve AI agents—and soon fine-tune and own their own models.
We value autonomy, speed, and innovation, and we’re building a world-class team to match. Our squads are lean, focused, and execution-driven.
If you thrive in high-performance environments and want to be part of a company that rewards transformational outcomes, this is for you.