Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Infrastructure Engineer (2026 Vision)

Nebula Systems
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
Live Update
11 Mei 2026
Deadline
11 Mei 2027

Job Description

Are you ready to define the technological landscape of 2026? Nebula Systems is seeking a visionary Senior AI Infrastructure Engineer to lead the development of next-generation neural architectures. We are not just building software; we are engineering the future of human-machine interaction. Join a team that prioritizes innovation, scalability, and ethical AI development in a fast-paced, remote-first environment.

Why Join Nebula Systems?

  • Impact: Work on projects that will define the AI standards for the next decade.
  • Autonomy: Enjoy the freedom to experiment with cutting-edge research and implement scalable solutions.
  • Growth: Competitive compensation packages and continuous learning opportunities with industry leaders.

If you are passionate about pushing the boundaries of what is possible in machine learning, we want to hear from you.

Responsibilities

  • Architect and deploy next-generation Large Language Models (LLMs) with a focus on reduced latency and high throughput.
  • Optimize deep learning training pipelines on distributed cloud infrastructure (AWS/GCP) to handle petabyte-scale datasets.
  • Collaborate with cross-functional teams of data scientists and product managers to translate research into production-ready features.
  • Implement robust MLOps workflows to ensure model reproducibility, monitoring, and automated retraining loops.
  • Conduct rigorous code reviews and contribute to the open-source community where applicable.
  • Stay ahead of the curve on emerging AI trends, including Generative AI, Reinforcement Learning, and Edge Computing.

Qualifications

  • Master’s or PhD in Computer Science, Mathematics, or a related technical field.
  • 7+ years of professional experience in software engineering, with a specific focus on AI/ML infrastructure.
  • Expert proficiency in Python, PyTorch, TensorFlow, or JAX.
  • Strong understanding of distributed systems, microservices, and containerization (Docker/Kubernetes).
  • Experience with cloud platforms (AWS, GCP, or Azure) and serverless architectures.
  • Excellent problem-solving skills and the ability to thrive in ambiguous, fast-changing environments.
  • Strong communication skills with the ability to explain complex technical concepts to non-technical stakeholders.

Required Skills

Python PyTorch TensorFlow Machine Learning Deep Learning Distributed Systems MLOps AWS Kubernetes GCP NLP

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All