Home Job Details
N
Information Technology 🏢 Full Time ⭐️ Verified

Senior AI Infrastructure Engineer

Nexus Horizon
San Francisco
Estimated Salary
USD 180.000 – USD 260.000
Live Update
17 Mei 2026
Deadline
17 Mei 2027

Job Description

We are building the technological backbone for the year 2026. Nexus Horizon is seeking a visionary Senior AI Infrastructure Engineer to design, deploy, and scale cutting-edge machine learning systems. In this pivotal role, you will bridge the gap between theoretical AI advancements and robust, production-ready infrastructure.

Your expertise will directly shape our roadmap, ensuring our platforms can handle the computational demands of next-generation artificial intelligence. If you are passionate about the future of tech and want to work at the forefront of innovation, we want to hear from you.

Why Join Us?

  • Work on projects that define the future of AI.
  • Competitive compensation and equity packages.
  • Flexible remote and hybrid work options.
  • Access to the latest hardware and research tools.

Responsibilities

  • Design and implement scalable distributed systems for training and deploying large language models (LLMs).
  • Optimize training pipelines and inference engines to reduce latency and improve throughput.
  • Collaborate with data science teams to translate research into production-grade code.
  • Ensure high availability, fault tolerance, and security of all AI infrastructure components.
  • Research and evaluate new hardware accelerators and cloud services to stay ahead of the 2026 tech curve.
  • Mentor junior engineers and establish best practices for AI engineering within the organization.

Qualifications

  • 5+ years of experience in software engineering, with at least 2 years specializing in AI/ML infrastructure.
  • Deep understanding of machine learning frameworks such as PyTorch, TensorFlow, or JAX.
  • Proficiency in cloud platforms (AWS, GCP, or Azure) and containerization technologies (Docker, Kubernetes).
  • Experience with high-performance computing clusters and GPU orchestration.
  • Strong programming skills in Python and C++.
  • Excellent problem-solving skills and the ability to work in a fast-paced, agile environment.

Required Skills

Python PyTorch TensorFlow Kubernetes Docker AWS GCP ML Ops Distributed Systems Machine Learning CUDA High Performance Computing

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All