Staff AI Infrastructure Engineer
15000 CNY~20000 CNY/Per month
Full-time
5~10 years
Refresh at 6 hours ago
187 Views
41 Apply
Shenzhen
Share
Job responsibilities
Job Responsibilities:
Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions
Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams
Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters
Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization
Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks
Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem
Job requirements
Minimum Skill Requirements:
Bachelor's degree in Computer Science, Engineering, or related technical field
5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain
Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc.
Strong communication and collaboration abilities, effective in working with diverse teams and individuals
Preferred Skill Requirements:
In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines
Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana)
Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions
A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space
Similar jobs
Search for your dream job
Job category
City or country
Company info
Also view
ESG Specialist
Dexin Group(D)
Full-time
Shanghai
Save job
0 Views
0 Apply
Refresh at 6 hours ago
Industriekaufmann
3M(S)
Full-time
Shenzhen
Save job
0 Views
0 Apply
Refresh at 6 hours ago
Italy Solutions Engineer
AUTEL Energy(D)
Full-time
Beijing
Save job
0 Views
0 Apply
Refresh at 6 hours ago
FR Legal Counsel
AUTEL Energy(D)
Full-time
Beijing
Save job
0 Views
0 Apply
Refresh at 6 hours ago
Smartphone User Experience Optimization Engineer
HONOR(R)
Full-time
Shenzhen
Save job
0 Views
0 Apply
Refresh at 6 hours ago
Assistant Manager (Mandarin)
Shanghai M&G Stationery Inc.(C)
Full-time
Beijing
Save job
0 Views
0 Apply
Refresh at a day ago
GM Personal Assistant (Mandarin Speaker)
Shanghai M&G Stationery Inc.(C)
Full-time
Beijing
Save job
0 Views
0 Apply
Refresh at a day ago
Human Resources and Administration Supervisors
Beijng Oriental Yuhong Waterproof Technology Co., Ltd.(D)
Full-time
Shanghai
Save job
0 Views
0 Apply
Refresh at a day ago
HR Head
Beijng Oriental Yuhong Waterproof Technology Co., Ltd.(D)
Full-time
Shenzhen
Save job
0 Views
0 Apply
Refresh at a day ago
Customs Specialist
Firstunion(H)
Full-time
Shanghai
Save job
0 Views
0 Apply
Refresh at a day ago