Job Description
We are on a mission to revolutionize enterprise intelligence by deploying next-generation Large Language Models. As the industry accelerates toward the technological horizon of 2026, we are seeking a visionary Senior AI/LLM Architect to lead our infrastructure and model strategy.
In this role, you will bridge the gap between cutting-edge research and scalable production systems. You will design the neural architectures that power our proprietary AI agents, ensuring they are not only intelligent but also efficient, secure, and explainable. If you thrive in a fast-paced, high-stakes environment and want to define the standard for AI in the future, we want to meet you.
Responsibilities
- Architect LLM Solutions: Design and implement robust, scalable architectures for Large Language Models and Generative AI applications.
- Model Optimization: Fine-tune and optimize models for inference speed, memory efficiency, and cost reduction on cloud infrastructure.
- Infrastructure Scaling: Oversee the deployment of AI models across distributed systems, ensuring 99.99% uptime and low latency.
- Research & Innovation: Stay ahead of the curve with the latest advancements in AI research (Transformers, Diffusion, Reinforcement Learning) and integrate them into our product roadmap.
- Collaboration: Partner with data scientists, software engineers, and product managers to translate technical requirements into architectural blueprints.
- Security & Ethics: Implement guardrails and safety protocols to ensure AI outputs are unbiased, secure, and compliant with regulatory standards.
Qualifications
- Education: Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related technical field.
- Experience: 5+ years of professional experience in AI/ML engineering, with at least 2 years specifically focused on LLMs or NLP.
- Technical Skills: Proficiency in Python, PyTorch, TensorFlow, or JAX. Deep understanding of transformer architectures and attention mechanisms.
- Deployment: Strong experience with cloud platforms (AWS, GCP, or Azure) and containerization tools (Docker, Kubernetes).
- System Design: Proven ability to design distributed systems capable of handling high-throughput data streams.
- Soft Skills: Exceptional communication skills with the ability to explain complex technical concepts to non-technical stakeholders.