1x Product + 1x ML Staff Engineer
VLM Run (https://vlm.run)
Machine LearningRemoteLeadvia Hacker News
About the Role
VLM Run (vlm.run) | 1x Product + 1x ML Staff Engineer | Santa Clara, CA (HQ)
We're building the inference and orchestration layer for production Vision-Language Models. We care deeply about fast and ergonomic visual inference, reliable structured outputs, and the observability to iterate on them.
A few things we've shipped recently you can poke at:
- Orion: our visual agent that reasons and acts over images, video, and documents. Chat at chat.vlm.run.
- mm-ctx: a Unix-style multimodal CLI (find, cat, grep, wc) that gives coding agents real context over images, video, and PDFs. Rust core, Python devex.
- vlmbench: single-file CLI for benchmarking VLM inference (TTFT, TPOT, throughput) across vLLM, Ollama, and SGLang.
Apply: app.dover.com/jobs/vlm-run
Email hiring "at" vlm.run with your GitHub + a couple recent projects.
[1] chat.vlm.run
[2] pypi.org/project/mm-ctx | www.vlm.run/open-source/mm
[3] github.com/vlm-run/vlmbench | www.vlm.run/open-source/vlmbench
Interested in this role?
Sign in to apply with your profile and CV.