Build the infrastructure that runs AI

We're hiring engineers who believe AI infrastructure should be sovereign, efficient, and built to outlast any single provider's roadmap.

$8M to Revolutionize AI Inference at the Edge

Read the coverage →

AI Application Engineer (Remote)

Build cross-platform AI applications that run locally on top of the OpenInfer Runtime Stack using C++ and Rust.

Research and develop acceleration techniques for transformer architectures, integrated into the PyTorch stack.

Build real-world AI applications with OpenInfer, create technical content, and grow the developer community.

Build secure, cross-platform assistant interfaces using React Native, integrating with OpenInfer's local AI runtime.

Build and maintain the Rust-based LLM inference engine, deployed across Windows, Linux, macOS, Android, and iOS.

Scale go-to-market for OpenInfer's inference infrastructure — enterprise sales, partnerships, and market positioning.

Lead partnership strategy, drive market adoption, and establish key collaborations across the AI inference ecosystem.

Build and optimize the inference engine for on-device AI — C/C++, low-level systems, power and performance.

Don't see your role? Reach out at recruiting@openinfer.io