Job Title: Systems Engineer III
Contract Duration: 9 months, possible extension
Location: Fremont, CA (Remote)
Summary:
- The main function of a systems engineer is to apply the principles of computer science and mathematical analysis to the design, development, testing, and evaluation of the software and systems that make computers work.
- A typical systems engineer analyzes user needs, and then designs, tests, and develops software to meet those needs.
Responsibilities:
- Develop back-end services, workflows and automation which handle fleet management, full server lifecycle (network boot, firmware updates, provisioning of os, failure detection, erase and decom)
- Develop out-of-band server management tooling in multi-vendor environment, including state management and telemetry collection
- Write and review code, test changes on hardware, automate testing
- Troubleshoot provisioning, firmware update and network boot issues end to end
Skills:
- 3+ years of experience in the area of:
- Experience in TCP/IP network fundamentals
- Experience with Linux systems and server systems management
- Experience in troubleshooting server network boot process from both network and servers point of view
- Experience in coding and scripting automation (Python, Go, Rust, Bash, Ruby)
- Experience in automating management of large fleets of server hardware (Redfish, IPMI)
- Experience with proven troubleshooting analytics of server/systems architecture and components
- Experience with Kubernetes, Docker and/or cloud deployment technologies.
Education/Experience:
- Bachelor’s degree in computer science, software engineering or relevant field required. Experience can be used in lieu of education.
- Linux/TCP/IP certification preferred.
Must Have Skills:
- TCP/IP Network Fundamentals – Deep understanding of networking concepts and troubleshooting network boot processes.
- Linux Systems & Server Management – Experience managing Linux servers, including provisioning, firmware updates, and lifecycle management.
- Coding & Scripting for Automation – Proficiency in automation using languages
Nice to Have Skills:
- Experience with Server Management Protocols – Familiarity with Redfish, IPMI for automating large server fleets.
- Cloud & Container Technologies – Experience with Kubernetes, Docker, and cloud deployment.
- Troubleshooting Analytics – Proven ability to analyze server/systems architecture and components.
- Linux/TCP/IP Certification – Certifications are preferred but not strictly required.
#LI-SN1
