Job Description
Are you ready to define the technological landscape of 2026? Nebula Systems is seeking a visionary Senior AI Infrastructure Engineer to lead the development of next-generation neural architectures. We are not just building software; we are engineering the future of human-machine interaction. Join a team that prioritizes innovation, scalability, and ethical AI development in a fast-paced, remote-first environment.
Why Join Nebula Systems?
- Impact: Work on projects that will define the AI standards for the next decade.
- Autonomy: Enjoy the freedom to experiment with cutting-edge research and implement scalable solutions.
- Growth: Competitive compensation packages and continuous learning opportunities with industry leaders.
If you are passionate about pushing the boundaries of what is possible in machine learning, we want to hear from you.
Responsibilities
- Architect and deploy next-generation Large Language Models (LLMs) with a focus on reduced latency and high throughput.
- Optimize deep learning training pipelines on distributed cloud infrastructure (AWS/GCP) to handle petabyte-scale datasets.
- Collaborate with cross-functional teams of data scientists and product managers to translate research into production-ready features.
- Implement robust MLOps workflows to ensure model reproducibility, monitoring, and automated retraining loops.
- Conduct rigorous code reviews and contribute to the open-source community where applicable.
- Stay ahead of the curve on emerging AI trends, including Generative AI, Reinforcement Learning, and Edge Computing.
Qualifications
- Masterβs or PhD in Computer Science, Mathematics, or a related technical field.
- 7+ years of professional experience in software engineering, with a specific focus on AI/ML infrastructure.
- Expert proficiency in Python, PyTorch, TensorFlow, or JAX.
- Strong understanding of distributed systems, microservices, and containerization (Docker/Kubernetes).
- Experience with cloud platforms (AWS, GCP, or Azure) and serverless architectures.
- Excellent problem-solving skills and the ability to thrive in ambiguous, fast-changing environments.
- Strong communication skills with the ability to explain complex technical concepts to non-technical stakeholders.