Job Description
We are building the technology stack for the year 2026. At FutureScale Inc., we are pioneering next-generation Agentic AI systems designed to revolutionize enterprise operations. We are seeking a visionary Senior AI Engineer to join our elite technical team and help define the future of autonomous intelligence.
In this role, you will not just be writing code; you will be architecting the brain of our next-generation platform. You will work directly with our CTO to push the boundaries of LLM orchestration, fine-tuning, and scalable inference engines.
Why Join Us?
- Work on cutting-edge projects that define the industry standard for 2026.
- Competitive compensation and equity package.
- Remote-first culture with access to world-class tools.
- Collaborate with top-tier talent in a fast-paced environment.
Responsibilities
- Architect Agentic AI Systems: Design and implement complex multi-agent workflows that can autonomously solve complex business problems.
- Model Optimization: Fine-tune and optimize large language models (LLMs) for specific domain tasks to maximize performance and reduce latency.
- MLOps Implementation: Build robust, scalable MLOps pipelines to manage the entire machine learning lifecycle, from data ingestion to model deployment.
- R&D Leadership: Conduct research and experiments with emerging AI technologies, including Reinforcement Learning from Human Feedback (RLHF) and multimodal models.
- Code Review & Mentorship: Lead code reviews and mentor junior engineers, ensuring code quality and adherence to best practices.
- Performance Tuning: Analyze system bottlenecks and implement optimizations to ensure high availability and low latency for AI inference services.
Qualifications
- Bachelor’s or Master’s degree in Computer Science, Mathematics, or a related field.
- 5+ years of professional experience in software engineering, with at least 3 years focused on Machine Learning or AI.
- Deep expertise in Python, PyTorch, and TensorFlow.
- Extensive experience working with state-of-the-art LLMs (e.g., GPT-4, Claude, Llama).
- Strong understanding of vector databases (e.g., Pinecone, Milvus) and RAG architectures.
- Familiarity with cloud platforms (AWS/GCP/Azure) and containerization technologies (Docker/Kubernetes).
- Experience with prompt engineering and model evaluation frameworks.
- Excellent problem-solving skills and the ability to work in a fast-paced, agile environment.