Build the infrastructure that runs AI

We're hiring engineers who believe AI infrastructure should be sovereign, efficient, and built to outlast any single provider's roadmap.

$8M to Revolutionize AI Inference at the Edge

Read the coverage →

AI Application Engineer (Remote)

Build cross-platform AI applications that run locally on top of the OpenInfer Runtime Stack using C++ and Rust.

Remote · Contract

AI Model Architecture Optimization Engineer R&D

Research and develop acceleration techniques for transformer architectures, integrated into the PyTorch stack.

San Mateo, CA · Full-time

Developer Relations and Community Engineer

Build real-world AI applications with OpenInfer, create technical content, and grow the developer community.

Bay Area, CA · Full-time

Full Stack Engineer – Applications (React Native / Browser / Local AI)

Build secure, cross-platform assistant interfaces using React Native, integrating with OpenInfer's local AI runtime.

Bay Area, CA · Full-time

Full-Time Engineer: AI Software Engineer

Build and maintain the Rust-based LLM inference engine, deployed across Windows, Linux, macOS, Android, and iOS.

Bay Area, CA · Full-time

GTM & Strategic Partnerships

Scale go-to-market for OpenInfer's inference infrastructure — enterprise sales, partnerships, and market positioning.

San Francisco Bay Area (Onsite Only) · Full-time

Head of Business Development

Lead partnership strategy, drive market adoption, and establish key collaborations across the AI inference ecosystem.

San Mateo, CA (Hybrid) · Full-time

Senior On-Device AI Inference Performance Engineer

Build and optimize the inference engine for on-device AI — C/C++, low-level systems, power and performance.

San Mateo, CA · Full-time

Don't see your role? Reach out at recruiting@openinfer.io