Full Stack Engineer (Rust/TypeScript/Python)
AI Inference Platform
Lund, Stockholm, London, Remote
About Inceptron
Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.
The Role
Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.
What you’ll do
Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).
Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).
Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.
Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.
Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).
Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.
What we’re looking for
Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.
Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).
Solid web fundamentals: you’ve built and owned production web apps end to end.
Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.
Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.
Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.
Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.
Nice to have
Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.
Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.
What you’ll get
Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.
Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.
Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.
How to Apply
Apply below with your résumé. Please include:
A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).
Links to your GitHub/portfolio and any production systems you can discuss.
Inceptron is an equal opportunity employer. We value inclusive teams and welcome applicants from all backgrounds.
About Inceptron
Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.
The Role
Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.
What you’ll do
Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).
Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).
Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.
Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.
Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).
Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.
What we’re looking for
Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.
Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).
Solid web fundamentals: you’ve built and owned production web apps end to end.
Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.
Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.
Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.
Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.
Nice to have
Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.
Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.
What you’ll get
Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.
Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.
Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.
How to Apply
Apply below with your résumé. Please include:
A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).
Links to your GitHub/portfolio and any production systems you can discuss.
Inceptron is an equal opportunity employer. We value inclusive teams and welcome applicants from all backgrounds.
About Inceptron
Inceptron is building a next generation AI inference platform powered by a deep, proprietary compiler stack. We help customers run AI workloads with lower latency, higher throughput, and better cost efficiency across GPUs, FPGAs, and edge hardware — allowing teams to scale production AI without scaling the bill.
The Role
Join our small, senior engineering team to build the product experiences that put our low-level performance engine in customers’ hands. You’ll work across the stack, from UI and APIs to billing, releases, and secure delivery images.
What you’ll do
Design and ship end-to-end features for our web console and API (onboarding, model upload/verification, optimization workflows, performance dashboards, A/B routing).
Build robust backend services with strong concurrency patterns (e.g. async job orchestration, streaming metrics, rate/usage metering).
Own ecommerce flows (usage metering, subscriptions, invoicing/entitlements) and integrate them cleanly with our platform.
Harden our Linux-based delivery artifacts (e.g. AMIs or comparable images) and own release engineering for secure, reproducible builds.
Maintain CI/CD pipelines (tests, artifacts, Docker images) and partner with DevOps on infrastructure-as-code (Kubernetes experience is welcome but not required).
Collaborate daily with compiler, systems, and ML engineers to expose advanced optimization controls safely and ergonomically to users.
What we’re looking for
Experienced programmers who have solved problems across multiple domains. You think in threads, queues, and backpressure as comfortably as in components and hooks.
Hands-on experience with system images: AWS AMIs or equivalent on other clouds (image hardening, baking, reproducible builds).
Solid web fundamentals: you’ve built and owned production web apps end to end.
Depth in at least one of Rust, TypeScript, or Python, with curiosity to learn the others.
Interest in problem domains such as web development, ecommerce, Linux security, and CI/CD pipelines.
Evidence of ownership: show us something you’ve architected end to end (even a personal project) where you chose the database and shipped it.
Collaboration skills: comfortable working in a tight, code review–driven team and communicating clearly.
Nice to have
Experience with Kubernetes in production, infrastructure-as-code, or observability stacks.
Exposure to compilers, TVM, GPU/accelerator runtimes, or high-performance/concurrent systems.
What you’ll get
Work at the performance frontier: help turn cutting-edge compiler and runtime tech into a delightful developer product used to speed up and scale AI in production.
Broad surface area: ship features that span product UX, secure delivery (AMIs/containers), and platform economics.
Impact and autonomy: join a small, senior team where your code will be in customers’ hands quickly.
How to Apply
Apply below with your résumé. Please include:
A short note on a system or app you owned end-to-end (link to repo/demo; tell us what you optimized).
Links to your GitHub/portfolio and any production systems you can discuss.