Open Role // Boost Capital Engineering

AI Software
Engineer

AI Reliability Production Systems Evaluation & Observability Remote Full-time USD 25K–65K / yr

Own the reliability, observability, evaluation, and production performance of the AI systems powering inclusive finance across emerging markets.

Apply now Read the role Send us your LinkedIn + GitHub →
01 / 06

About Boost

At Boost, we build the AI infrastructure powering digital onboarding for banks and microfinance institutions across Southeast Asia. We enable financial institutions to onboard clients digitally for loans and savings in 5–10 minutes — through chat, without app downloads.

Traditionally, banking meant physical branches and long forms. Boost turns that on its head — with chat-based lending, document AI, and agentic workflows that make financial services 100x faster and radically more accessible. We integrate with a new partner bank in 2–3 weeks, helping them expand their reach instantly.

Backed by top-tier institutional and angel investors, Boost is building the digital rails for inclusive finance in emerging markets.

5–10 min
Digital onboarding, via chat
100x
Faster than branch banking
2–3 wks
To integrate a new partner bank
02 / 06

The Role

We're looking for an AI Reliability Engineer to own the reliability, observability, evaluation, and production performance of AI systems — making sure agents and workflows are measurable, reliable, cost-efficient, and production-ready.

This is not a pure ML research role, and it's not a traditional DevOps role. You'll sit between AI Engineering and Infrastructure. You should be comfortable debugging a Kubernetes deployment one hour and designing an LLM evaluation framework the next.

03 / 06

What You'll Do

// Reliability & Infrastructure

Keep AI up & fast

  • Design and maintain highly available AI systems running in production.
  • Build deployment, monitoring, and incident response processes for AI applications.
  • Improve system uptime, latency, scalability, and resilience.
  • Manage cloud infrastructure across AWS, GCP, or Azure.
  • Develop CI/CD pipelines for AI products and agent workflows.
  • Implement disaster recovery and rollback strategies.
// AI Evaluation & Quality

Measure everything

  • Design evaluation frameworks for LLM apps, agents, RAG systems, and AI workflows.
  • Create benchmarks for quality, accuracy, latency, cost, and user outcomes.
  • Build automated testing pipelines for prompts, tools, and agent behaviors.
  • Develop regression testing to detect quality degradation before deployment.
  • Monitor production AI performance and identify failure patterns.
  • Define service-level objectives (SLOs) for AI systems.
// Prompt Engineering & AI Ops

Operate the models

  • Design and optimize prompts for reliability and consistency.
  • Evaluate model performance across OpenAI, Anthropic, Gemini, and open-source models.
  • Build systems for prompt versioning and experimentation.
  • Create automated prompt evaluation and A/B testing frameworks.
  • Establish best practices for AI deployment and governance.
// Observability & Analytics

See the signals

  • Build dashboards and monitoring systems for AI applications.
  • Track hallucination rates, tool success, task completion, latency, tokens, and cost.
  • Design alerting for model drift, quality degradation, and infrastructure failures.
  • Analyze production data to improve AI system performance.
04 / 06

What We're Looking For

Strongly Preferred

  • 3+ years in SRE, Platform, DevOps, Infrastructure Engineering, or related roles.
  • Operating production systems with uptime and performance responsibilities.
  • Strong understanding of distributed systems.
  • Experience with Kubernetes and Docker.
  • Experience with Terraform or infrastructure-as-code tools.
  • Strong Python skills.
  • Building monitoring and observability systems.

AI Experience

  • Building or deploying LLM-powered applications.
  • Familiarity with prompt engineering techniques.
  • Eval frameworks: LangSmith, Braintrust, Weights & Biases, Arize, Patronus, or similar.
  • Understanding of RAG systems, AI agents, and tool calling.
  • Benchmarking AI models.
  • Systematically evaluating model performance rather than relying on intuition.

Bonus Points

  • Running AI workloads at scale.
  • Experience with vector databases.
  • ML infrastructure or MLOps.
  • Experience with agent frameworks.
  • Fine-tuning or model training.
  • Managing GPU infrastructure.
05 / 06

Your First 90 Days

By the end of your first three months, you will have:

01

Established baseline reliability metrics for our AI systems.

02

Implemented automated evaluation pipelines for core AI workflows.

03

Built monitoring dashboards covering quality, latency, and cost.

04

Defined AI-specific SLOs and incident response procedures.

05

Reduced production failures and improved deployment confidence.

06

Created repeatable benchmarks that let us measure progress objectively.

06 / 06

Traits We Value

Obsessive about measurement Skeptical of anecdotal evidence Strong systems thinker Owns production systems Pragmatic, not academic Moves fast without sacrificing reliability Monitor · test · measure everything
FAQ

Common Questions

What is the AI Software Engineer role at Boost Capital?

It's an AI Reliability Engineer role that owns the reliability, observability, evaluation, and production performance of Boost Capital's AI systems. You sit between AI Engineering and Infrastructure — making AI applications, agents, and workflows measurable, reliable, cost-efficient, and production-ready. It is neither a pure ML research role nor a traditional DevOps role.

What experience is required?

We strongly prefer 3+ years in Site Reliability, Platform, DevOps, or Infrastructure Engineering, strong Python, and experience with Kubernetes, Docker, and Terraform. AI experience matters too: building or deploying LLM-powered applications, prompt engineering, evaluation frameworks such as LangSmith, Braintrust, Weights & Biases, Arize, or Patronus, and understanding of RAG systems, AI agents, and tool calling.

Is the role remote?

Yes — it's a remote (telecommute) full-time role open to candidates in the United Kingdom, France, the Philippines, Vietnam, and Thailand.

What's the salary range?

USD 25,000–65,000 per year, based on experience.

How do I apply?

Submit your LinkedIn and GitHub using the application form below. You can also email carlo@boostcapital.io or connect on LinkedIn and message directly.

What does Boost Capital do?

Boost Capital builds the AI infrastructure powering digital onboarding for banks and microfinance institutions across Southeast Asia — enabling financial institutions to onboard clients for loans and savings in 5–10 minutes through chat, without app downloads, using chat-based lending, document AI, and agentic workflows. Boost integrates with a new partner bank in 2–3 weeks.

Apply // Two links is all we need

Send us
your work.

No cover letter, no long form. Drop your LinkedIn and GitHub and we'll take it from there. We read every application.

We only use these details to consider your application. By submitting, you consent to Boost Capital contacting you about this role.