Job Description
The Future of Computing Starts Here.
Nexus Horizons Inc. is pioneering the frontier of Artificial General Intelligence (AGI) infrastructure. We are seeking a visionary AI/ML Infrastructure Engineer to build the backbone of next-generation neural networks. If you thrive in high-performance computing environments and are obsessed with scalability and efficiency, we want you on our team.
As we accelerate towards a fully autonomous future, your work will directly impact how billions of interactions are processed in real-time. You will bridge the gap between complex algorithms and robust, fault-tolerant systems.
Why Join Us?
- Work with cutting-edge technology that defines the 2026 roadmap.
- Competitive compensation and equity packages.
- Flexible remote-first culture with HQ in the heart of the tech hub.
Responsibilities
- Architect and maintain high-throughput GPU clusters for training large-scale transformer models.
- Implement and optimize MLOps pipelines using Kubernetes and Terraform for CI/CD.
- Ensure system reliability and performance under heavy load through proactive monitoring and auto-scaling.
- Collaborate with data scientists to optimize data pipelines for ingestion and preprocessing.
- Research and integrate emerging hardware acceleration technologies (e.g., NVLink, InfiniBand).
Qualifications
- 5+ years of experience in DevOps, SRE, or Infrastructure Engineering.
- Deep proficiency in Python, Go, or Rust.
- Strong expertise in containerization (Docker, Podman) and orchestration (Kubernetes).
- Experience with cloud platforms, specifically AWS or Google Cloud Platform (GCP).
- Understanding of distributed systems, load balancing, and caching strategies.