NVIDIA is looking for a Data Center Network Deployment Engineer to join the Networking clusters solutions HPC/AI Infrastructure team. We are building supercomputers and AI clusters based on groundbreaking technologies. We are looking for a network/system engineer to be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing.
You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms. Does this sound like you? If so, we would love to hear from you!
What you'll be doing:
Deploy, manage and maintain large scale AI Data Centers - control, network and storage stack
Work with multiple software and hardware teams to optimize the clusters networking health and performance
Develop and implement automation scripts for network, compute and storage operations and deployments
Supporting Research & Development activities and engaging in POCs/POVs for future improvements
What we need to see:
B.Sc. in Engineering or CCNP certificate
3+ years of proficiency in networking fundamentals, configuring ethernet switches, understanding the TCP/IP stack, and data center architecture.
Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalls, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
Proactive individual with the ability to work independently, prioritizing tasks to optimize technology and enhance customer experience.
Provides ad-hoc knowledge transfers, develops handover materials, and offers deployment support for engagements.
Ways to stand out from the crowd:
Combination of interpersonal skills and technical competence
Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software
Experience with multiple storage solutions such as Lustre, GPFS, and newer and emerging storage technologies.
Automation tooling background (Ansible, Salt, Puppet etc.).
NVIDIA is widely considered to be one of the technology world’s most desirable employers! We have some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you!
#IL-Hybrid