Home Job Details
N
Information Technology 🏢 Full Time ⭐️ Verified

Generative AI Architect

Nexus Future Systems
San Francisco
Estimated Salary
USD 160.000 – USD 230.000
New
Live Update
21 Mei 2026
Deadline
21 Mei 2027

Job Description

Are you ready to define the technological landscape of 2026? Nexus Future Systems is seeking a world-class Generative AI Architect to architect the next generation of intelligent systems.

As we stand on the precipice of a new era in artificial intelligence, our team is dedicated to pushing the boundaries of what is possible. You will be responsible for building scalable, efficient, and ethically sound AI models that power our core products. This is a rare opportunity to lead high-impact projects in a fast-growing, innovative environment.

Why Join Us?
We offer a competitive benefits package, including equity options, comprehensive health coverage, and a flexible remote-first work culture. You will work with cutting-edge technology and collaborate with the brightest minds in the industry.

What You Will Do

Your mission is to build the future of AI. Key responsibilities include:

  • Architect and deploy large-scale generative models (LLMs) for production environments.
  • Optimize model inference performance and reduce latency using advanced hardware acceleration.
  • Design novel prompt engineering strategies and Retrieval-Augmented Generation (RAG) pipelines.
  • Collaborate with product managers to translate complex AI concepts into user-friendly features.
  • Ensure data privacy, security, and compliance with industry standards.
  • Mentor junior engineers and foster a culture of continuous learning and innovation.

Who You Are

We are looking for a visionary leader with a passion for deep learning. You possess a deep understanding of the mathematical foundations of AI and the practical skills to bring models to life.

  • 5+ years of experience in machine learning engineering or software development.
  • Proficiency in Python, PyTorch, and TensorFlow.
  • Deep understanding of Natural Language Processing (NLP) and transformer architectures.
  • Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker/Kubernetes).
  • Strong mathematical background in linear algebra and statistics.
  • Excellent communication skills and ability to thrive in a fast-paced, remote-first environment.

Responsibilities

  • Architect and deploy large-scale generative models (LLMs) for production environments.
  • Optimize model inference performance and reduce latency using advanced hardware acceleration.
  • Design novel prompt engineering strategies and Retrieval-Augmented Generation (RAG) pipelines.
  • Collaborate with product managers to translate complex AI concepts into user-friendly features.
  • Ensure data privacy, security, and compliance with industry standards.
  • Mentor junior engineers and foster a culture of continuous learning and innovation.

Qualifications

  • 5+ years of experience in machine learning engineering or software development.
  • Proficiency in Python, PyTorch, and TensorFlow.
  • Deep understanding of Natural Language Processing (NLP) and transformer architectures.
  • Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker/Kubernetes).
  • Strong mathematical background in linear algebra and statistics.
  • Excellent communication skills and ability to thrive in a fast-paced, remote-first environment.

Required Skills

Python PyTorch TensorFlow LLMs NLP Machine Learning Cloud Computing Docker Kubernetes AWS GCP

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All