Cloud Technical Solutions Engineer, Compute

5 days ago


Singapore Google Full time

Google will be prioritizing applicants who have a current right to work in Singapore, and do not require Google's sponsorship of a visa. This role requires you to work in a shift pattern or non-standard work hours as required. This may include weekend work. Minimum qualifications: Bachelor's degree in Science, Technology, Engineering, Mathematics, or equivalent practical experience. 6 years of experience writing code in one or more general purpose programming languages (e.g., C++, Java, Python, Go, etc.). Preferred qualifications: Experience working directly with AI/ML computing hardware, including GPUs or other accelerators. Experience working with large-scale distributed systems, and with common solutions, design patterns, or best practices. Experience with containerization and orchestration technologies like Kubernetes or Slurm in an on-prem or cloud environment. Experience with ML frameworks (e.g., TensorFlow, Pytorch). Experience troubleshooting and advocating for customer needs, and triaging technical issues across the stack (e.g., hardware faults, low-level software, networking, virtualization, kernel drivers, firmware, and performance). Understanding of the AI/ML training and inference lifecycle. About the job In this role, you will be a part of a global team that provides support to help customers seamlessly make the switch to Google Cloud. When customers cannot resolve issues themselves, your job is to ensure that we have the necessary tools and processes to resolve the issue. You will troubleshoot technical problems for customers with a mix of debugging, networking, system administration, updating documentation, and when needed, coding/scripting. You will make the products easier to adopt and to use by making improvements to the product, tools, processes, and documentation. The Technical Solutions Engineering team is focused on customer needs, and you will help drive the success and business growth of Google Cloud by understanding and advocating for our customers issues and tests. Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems. Responsibilities Manage customer’s problems through effective diagnosis, resolution, or implementation of new investigation tools to increase productivity for customer issues on AI/ML infrastructure. Develop an in-depth understanding of AI/ML workloads and underlying hardware architectures by troubleshooting, reproducing, and determining the root cause for customer reported issues, and build tools for faster diagnosis. Be a consultant and subject matter expert for internal stakeholders in engineering, sales, and customer organizations to resolve complex deployment and operational obstacles in AI infrastructure environments. Work closely with multiple product and engineering teams to find ways to improve the product, and interact with our Site Reliability Engineering (SRE) teams to drive high-quality production. Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form. #J-18808-Ljbffr



  • Singapore Google Full time

    Google will be prioritizing applicants who have a current right to work in Singapore, and do not require Google's sponsorship of a visa. This role requires you to work in a shift pattern or non-standard work hours as required. This may include weekend work. **Minimum qualifications**: - Bachelor's degree in Science, Technology, Engineering, Mathematics,...


  • Singapore Assurity Trusted Solutions Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a trusted partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services, as well as managed processes. In a...


  • Singapore Assurity Trusted Solutions Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...


  • Singapore Assurity Trusted Solutions Pte Ltd Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a trusted partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services, to managed processes. In a dynamic...


  • Singapore Assurity Trusted Solutions Pte Ltd Full time

    4 weeks ago Be among the first 25 applicants Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance...


  • Singapore TENCENT CLOUD INTERNATIONAL PTE. LTD. Full time

    **Responsibilities**: - Design and customize cloud solutions for customers, ensuring seamless implementation and project delivery. - Act as the technical bridge between customers and Tencent Cloud’s backend teams, translating customer requirements and industry best practices into actionable solutions. - Monitor and oversee project implementation, ensuring...

  • Solutions Architect

    2 weeks ago


    Singapore Alibaba Cloud Full time

    Overview Solutions Architect (SA) is a key pre-sales technical position in Alibaba Cloud international business division. Our SAs are experienced cloud architects with comprehensive IT knowledge and industry insight and are, ultimately, the key to success on implementing projects. Solutions Architects are responsible for developing solutions in Alibaba Cloud...


  • Singapore Razer Inc. Full time

    Technical Lead and Architect, AI Cloud Compute Services Join to apply for the Technical Lead and Architect, AI Cloud Compute Services role at Razer Inc. Technical Lead and Architect, AI Cloud Compute Services Join to apply for the Technical Lead and Architect, AI Cloud Compute Services role at Razer Inc. Direct message the job poster from Razer Inc. Joining...

  • Cloud Engineer

    1 week ago


    Singapore NANO CLOUD PRIVATE LIMITED Full time

    Job Description & Requirements Mandatory skill : 1) Cloud Computing (Azure) Certifications. 2)Azure Solutions Architect Expert is advantageous. Job Responsibilities: To lead and manage, troubleshooting, support across various domain teams. Day1 implementation and existing Day2 operations of Cloud Operations. - At least ten (10) years’ experience in...

  • Solution Architect

    1 week ago


    Singapore ARYAN SOLUTIONS PTE. LTD. Full time

    Roles & Responsibilities - Create architecture blueprint for a loyalty award eco system deployed on AWS - Formulate a strategy for short and long term evolution of the systems - Coordinate with business and external parties on the value proposition of the system and participate in deep architectural discussions to ensure solutions are designed for successful...