Job Description
Are you ready to define the technological landscape of 2026? Nexus Future Systems is seeking a world-class Generative AI Architect to architect the next generation of intelligent systems.
As we stand on the precipice of a new era in artificial intelligence, our team is dedicated to pushing the boundaries of what is possible. You will be responsible for building scalable, efficient, and ethically sound AI models that power our core products. This is a rare opportunity to lead high-impact projects in a fast-growing, innovative environment.
Why Join Us?
We offer a competitive benefits package, including equity options, comprehensive health coverage, and a flexible remote-first work culture. You will work with cutting-edge technology and collaborate with the brightest minds in the industry.
What You Will Do
Your mission is to build the future of AI. Key responsibilities include:
- Architect and deploy large-scale generative models (LLMs) for production environments.
- Optimize model inference performance and reduce latency using advanced hardware acceleration.
- Design novel prompt engineering strategies and Retrieval-Augmented Generation (RAG) pipelines.
- Collaborate with product managers to translate complex AI concepts into user-friendly features.
- Ensure data privacy, security, and compliance with industry standards.
- Mentor junior engineers and foster a culture of continuous learning and innovation.
Who You Are
We are looking for a visionary leader with a passion for deep learning. You possess a deep understanding of the mathematical foundations of AI and the practical skills to bring models to life.
- 5+ years of experience in machine learning engineering or software development.
- Proficiency in Python, PyTorch, and TensorFlow.
- Deep understanding of Natural Language Processing (NLP) and transformer architectures.
- Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker/Kubernetes).
- Strong mathematical background in linear algebra and statistics.
- Excellent communication skills and ability to thrive in a fast-paced, remote-first environment.
Responsibilities
- Architect and deploy large-scale generative models (LLMs) for production environments.
- Optimize model inference performance and reduce latency using advanced hardware acceleration.
- Design novel prompt engineering strategies and Retrieval-Augmented Generation (RAG) pipelines.
- Collaborate with product managers to translate complex AI concepts into user-friendly features.
- Ensure data privacy, security, and compliance with industry standards.
- Mentor junior engineers and foster a culture of continuous learning and innovation.
Qualifications
- 5+ years of experience in machine learning engineering or software development.
- Proficiency in Python, PyTorch, and TensorFlow.
- Deep understanding of Natural Language Processing (NLP) and transformer architectures.
- Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker/Kubernetes).
- Strong mathematical background in linear algebra and statistics.
- Excellent communication skills and ability to thrive in a fast-paced, remote-first environment.