Full Stack Engineer (Rust/TypeScript/Python)
AI Inference Platform

Lund, Stockholm, London, Remote

About Inceptron

Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.

The Role

Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.

What you’ll do

  • Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).

  • Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).

  • Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.

  • Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.

  • Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).

  • Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.


What we’re looking for

  • Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.

  • Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).

  • Solid web fundamentals: you’ve built and owned production web apps end to end.

  • Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.

  • Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.

  • Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.

  • Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.


Nice to have

  • Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.

  • Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.

What you’ll get

  • Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.

  • Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.

  • Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.


How to Apply

Apply below with your résumé. Please include:

  • A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).

  • Links to your GitHub/portfolio and any production systems you can discuss.

Inceptron is an equal opportunity employer. We value inclusive teams and welcome applicants from all backgrounds.

About Inceptron

Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.

The Role

Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.

What you’ll do

  • Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).

  • Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).

  • Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.

  • Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.

  • Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).

  • Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.


What we’re looking for

  • Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.

  • Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).

  • Solid web fundamentals: you’ve built and owned production web apps end to end.

  • Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.

  • Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.

  • Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.

  • Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.


Nice to have

  • Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.

  • Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.

What you’ll get

  • Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.

  • Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.

  • Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.


How to Apply

Apply below with your résumé. Please include:

  • A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).

  • Links to your GitHub/portfolio and any production systems you can discuss.

Inceptron is an equal opportunity employer. We value inclusive teams and welcome applicants from all backgrounds.

About Inceptron

Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.

The Role

Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.

What you’ll do

  • Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).

  • Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).

  • Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.

  • Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.

  • Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).

  • Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.


What we’re looking for

  • Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.

  • Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).

  • Solid web fundamentals: you’ve built and owned production web apps end to end.

  • Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.

  • Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.

  • Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.

  • Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.


Nice to have

  • Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.

  • Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.

What you’ll get

  • Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.

  • Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.

  • Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.


How to Apply

Apply below with your résumé. Please include:

  • A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).

  • Links to your GitHub/portfolio and any production systems you can discuss.

Inceptron is an equal opportunity employer. We value inclusive teams and welcome applicants from all backgrounds.

Next generation
AI compute optimization

© Inceptron 2025

Next generation
AI compute optimization

© Inceptron 2025

Next generation
AI compute optimization

© Inceptron 2025