Member of Technical Staff, Inference

Radical Numerics
Radical Numerics

IT

San Francisco, CA, USA

Posted on Jun 15, 2026

About Us


Radical Numerics is an AI research lab building general biological intelligence. Our mission is to master the code of life, and our purpose is to reduce human suffering.

Our team created Evo, and started the field of generative genomics. Our work was featured on the cover of Science, and presented by our CEO on the main stage of TED2025. Evo was used to create the first AI gene therapy tool CRISPR-Cas9, and the first AI whole genome from scratch. Evo 2, featured in Nature, is the largest fully open source AI project across any domain.

Radical Numerics is bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We’ve redesigned the foundation model training stack to turn the world’s raw scientific data (e.g. biological sequences, experiments, and physical processes), into intelligible, generative models that can expand and accelerate what humanity can understand, design, and cure.

The same generative breakthroughs that enable life-saving cures also lowers the barrier to creating engineered threats and AI-generated bioweapons. We believe these forces are inseparable. Radical Numerics was founded to develop both the power to design and the responsibility to defend.

About the Role

As a Member of Technical Staff, Inference at Radical Numerics, you will build and optimize the systems that bring frontier biological AI models into production. Your work will focus on delivering state-of-the-art inference performance for large-scale genome and multimodal biological models across a wide range of real-world applications, including therapeutics, diagnostics, synthetic biology, and biodefense.

This is a highly technical role at the intersection of AI systems, distributed computing, and model deployment. You will work closely with research, infrastructure, and external partners to ensure our models can be efficiently deployed, scaled, and integrated into production environments. Success in this role requires deep expertise in large language model inference, kernel optimization, GPU systems, and performance engineering.

You should be excited by questions such as: How do we reduce inference latency for 100B MoE models? How do we maximize throughput across heterogeneous hardware environments? How do we optimize custom kernels for emerging hybrid model architectures? How do we deploy foundation models reliably across cloud, on-premise, and highly regulated environments? How do we enable our partners to transform biological research and development through production-grade AI systems?

What You'll Do

Drive end-to-end performance improvements. Identify and eliminate bottlenecks across the inference stack, from model execution and memory management to networking, scheduling, and hardware utilization.

Develop high-performance inference primitives. Build and optimize GPU kernels, numerical operators, and serving infrastructure to maximize throughput, latency, and efficiency on modern accelerator platforms.

Partner with external customers and collaborators. Work directly with pharmaceutical companies, biotech organizations, research institutions, and government partners to deploy models in production environments and solve challenging technical problems.

Build scalable deployment infrastructure. Create systems for serving, monitoring, benchmarking, and operating foundation models reliably across cloud, enterprise, and secure environments.

Collaborate with research and platform teams. Ensure new model architectures can be efficiently deployed at scale and help translate frontier AI research into real-world impact.

What We're Looking For

Expertise in large-scale AI inference systems. Proven experience optimizing, deploying, and operating LLMs or other foundation models in production environments.

Strong performance engineering and kernel development skills. Deep understanding of GPU architectures and experience with CUDA, Triton, or equivalent technologies for building high-performance numerical software.

Systems-level thinking. Ability to diagnose and solve bottlenecks across the full stack, including model architectures, serving systems, networking, memory management, and distributed infrastructure.

Hands-on builder. Strong software engineering fundamentals with proficiency in Python and modern ML frameworks such as PyTorch.

Customer and deployment orientation. Experience working closely with users, customers, or cross-functional stakeholders to deliver production AI systems that solve real-world problems.

Excellent technical communication. Ability to collaborate effectively across research, engineering, infrastructure, and scientific teams.

Nice to Have

  • Experience with inference frameworks such as vLLM, TensorRT-LLM, SGLang, DeepSpeed, or similar systems.

  • Contributions to open-source AI infrastructure, inference frameworks, compilers, or kernel libraries.

  • Experience with distributed systems, cloud infrastructure, and large-scale GPU clusters.

  • Familiarity with biological foundation models, computational biology, genomics, or scientific AI applications.

  • Experience operating AI systems in regulated, secure, or mission-critical environments

Why Radical Numerics

Help build the infrastructure that powers the next generation of biological AI models and deploy them into some of the world's most important scientific and healthcare applications.

Work on some of the largest and most capable open biological AI models, helping transform breakthroughs in AI research into real-world impact across therapeutics, diagnostics, synthetic biology, and biodefense.

Join a team that brings together expertise in distributed systems, model architecture, numerics, AI safety, and biology.

Collaborate with leading researchers across AI labs, biotechs, pharmaceutical companies, hospital systems, government programs, and scientific institutions.

Radical Numerics is committed to equal employment opportunity and does not discriminate in any employment opportunities or practices based on an individual's race, color, creed, gender (including gender identity and gender expression), religion (all aspects of religious beliefs, observance or practice, including religious dress or grooming practices), marital status, registered domestic partner status, age, national origin or ancestry (including language use restrictions and possession of a driver’s license issued under California Vehicle Code section 12801.9), natural hair, physical or mental disability, political affiliation, medical condition (including cancer or a record or history of cancer, and genetic characteristics), sex (including pregnancy, childbirth, breastfeeding or related medical condition), genetic information, sexual orientation, military and veteran status or any other consideration made unlawful by federal, state, or local laws. It also prohibits unlawful discrimination based on the perception that anyone has any of those characteristics, or is associated with a person who has or is perceived as having any of those characteristics.