Senior System Software Engineer - TAO ML Data Platforms
Company: Aitopics
Location: Santa Clara
Posted on: October 31, 2024
Job Description:
Senior System Software Engineer - TAO ML Data PlatformsNVIDIA is
hiring a Senior Systems Software Engineer for the ML Data Platform
to join the TAO Toolkit Team. Our team builds frameworks, services,
algorithms, and tools that power the largest NVIDIA Multi-Modal
Foundation Models and their customization. We are seeking someone
who can help develop novel, scalable, and automated pipelines to
make sense of petabytes of unstructured data and collaborate with
multiple deep-learning architects and engineers to enable the
development of pioneering AI models.What you'll be doing:
- Design, develop, and support a platform to access large
datasets, integrating data from various sources.
- Build scalable and robust ETL pipelines to deliver high-quality
datasets.
- Implement lake house structures and model data for optimized
reporting to analyze the datasets.
- Develop and maintain cloud services for data management and
machine learning workflows in a production environment.
- Help to find the right data for a Multi-Modal model with
scalable query-retrieval systems.
- Work with internal teams to define requirements, enhance
products, and automate workflows.What we need to see:
- Bachelor's degree (or equivalent experience) in Computer
Engineering, Computer Science, Electrical Engineering, Robotics, or
related field.
- 5+ years of ML / related software engineering experience with
strong architecture and design skills.
- Proficient in Python and familiar with knowledge of the latest
tools.
- Experience with SQL databases and cloud infrastructure (AWS,
GCP, Kubernetes).
- Experience in high-performance computing environments and
workflow automation frameworks (e.g., Airflow).
- Knowledge of PyTorch, distributed machine learning, and
distributed file systems.
- 3+ years leading complex sometimes ambiguous projects,
particularly in high-throughput services at supercomputing
scale.
- Proficient in running applications on cloud platforms using
Kubernetes and Docker, and ML frameworks like Pytorch.Ways to stand
out from the crowd:
- Familiar with GPU programming concepts, and writing custom CUDA
kernels.
- Experience with Dagster, Terraform.
- Proficient in building systems and familiar with deep learning
architectures and tools like Nvidia TensorRT-LLM, Multimodal-LLM,
and Triton Server.With a competitive salary package and benefits,
NVIDIA is widely considered to be one of the technology world's
most desirable employers. Are you a creative and autonomous Senior
Software Engineer, who loves challenges? Do you have a genuine
passion for advancing the state of Data Science across a variety of
industries? If so, we want to hear from you.The base salary range
is 148,000 USD - 276,000 USD. Your base salary will be determined
based on your location, experience, and the pay of employees in
similar positions.NVIDIA is committed to fostering a diverse work
environment and proud to be an equal opportunity employer. As we
highly value diversity in our current and future employees, we do
not discriminate (including in our hiring and promotion practices)
on the basis of race, religion, color, national origin, gender,
gender expression, sexual orientation, age, marital status, veteran
status, disability status or any other characteristic protected by
law.
#J-18808-Ljbffr
Keywords: Aitopics, Hayward , Senior System Software Engineer - TAO ML Data Platforms, IT / Software / Systems , Santa Clara, California
Didn't find what you're looking for? Search again!
Loading more jobs...