Hardware Integration & Standardized Delivery: Responsible for the rack installation and physical connection of GPU servers, liquid-cooled cabinets, and network equipment. Strictly adhere to hardware installation standards to ensure operational safety and compliance.
Hardware Diagnostics & Maintenance: Troubleshoot and maintain core server components (motherboard, CPU, GPU, memory, HDD, PSU). Monitor hardware health status to promptly identify and mitigate potential risks.
Liquid Cooling System Support: Collaborate with Facility Management engineers for liquid cooling system inspections and fault handling. Possess rapid response capabilities for emergencies such as leaks or blockages.
System-Level Troubleshooting: Conduct in-depth fault diagnosis based on Linux/Unix systems, produce professional hardware failure analysis reports, and provide improvement recommendations.