Job Description
We are looking for a visionary Senior AI Research Engineer to lead the next generation of reasoning models. Nexus Intelligence is pioneering breakthroughs in OpenAI o1 architecture and Chain-of-Thought inference. Join a team of elite researchers pushing the boundaries of what's possible in artificial general intelligence.
In this role, you will bridge the gap between cutting-edge model capabilities and practical enterprise applications, optimizing performance for complex problem-solving tasks.
Why Join Us?
- Work with the latest OpenAI o1 and LLM technologies.
- Competitive compensation package in the heart of San Francisco.
- Flexible remote-first policy with occasional in-office collaboration.
- Access to top-tier compute resources and research tools.
Responsibilities
- Model Evaluation & Optimization: Rigorously evaluate and fine-tune OpenAI o1-preview and o1-mini models to enhance reasoning accuracy and efficiency.
- Architecture Design: Design and implement novel architectures for post-training and distillation of reasoning models.
- Data Curation: Curate high-quality datasets specifically designed to trigger and improve Chain-of-Thought reasoning capabilities.
- System Integration: Integrate advanced AI models into scalable production pipelines, ensuring low-latency inference.
- R&D Leadership: Mentor junior researchers and contribute to the company's patent portfolio regarding generative reasoning.
- Performance Tuning: Optimize model latency and token costs while maintaining high output fidelity.
Qualifications
- Education: M.S. or Ph.D. in Computer Science, Machine Learning, or a related quantitative field.
- Experience: 5+ years of experience in Deep Learning, NLP, or Reinforcement Learning.
- Technical Skills: Proficiency in Python, PyTorch, and TensorFlow.
- Model Expertise: Deep understanding of Large Language Models (LLMs), Transformers, and specifically OpenAI o1 capabilities.
- Problem Solving: Proven track record of solving complex technical challenges in AI research.
- Communication: Ability to communicate complex technical concepts to both technical and non-technical stakeholders.