Senior Software Engineer, Cloud Automation
Job Description
We are now looking for a Software Engineer for Cloud Automation
NVIDIA Cloud platform engineering is looking for outstanding DevOps infrastructure automation engineers to deliver our Nvidia GPU Cloud services. Be part of the team that delivers 250K+ automation runs per week 24x7 around the clock with thousands of automation projects. This is an unusual opportunity to work on multi-OSes, multi-cloud platforms and virtualization technologies at the same time.
We are looking for dedicated engineers who aren’t afraid of taking up new DevOps and cloud technologies, processes, and tools to make constant improvement on the automation infrastructure. You will collaborate closely with application developers and QA to build the world-class automation infrastructure for deployment, testing, monitoring and CI/CD operations.
What you'll be doing:
Work with Configuration Management tools and Workflow management tool like Ansible and Stackstorm to manage and deploy COLO multi-platform server clusters in Nvidia Data Centers around the world.
You will design and develop different infrastructures, tools, and automation scripts to improve automation support for varies activities such as image management, switch management, deployment, data analytics, automated testing, logging, monitors and alerts for different micro-services.
Work with data centers infrastructure tools like netbox, foreman and Mellanox switches to manage DCs at scale.
Work on the latest infrastructure management like Kubernetes and docker for fast and consistent delivery.
You will be the owner of infrastructure for micro-services and provide operation support to application teams such as automation service, infrastructure and security improvement, and live service troubleshooting.
What we need to see:
BS in Computer Science/Engineering/Math/Physics
3+ years of proven experience
Excellent scripting: Python, bash, Groovy, GOLANG
Outstanding debugging skills
Cloud experience with AWS Compute, Containers, and Networking services are preferable.
CI/CD experience with Jenkins and Jenkins pipeline
Experience with Configuration Management such as Ansible is a big plus.
Experience with Packer, Terraform and StackStorm is a plus.
Experience with Kubernetes, docker and Helm is a huge plus
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us. If you're creative and autonomous, we want to hear from you!
We are an AA/EEO/Veterans/Disabled employer