Cloud Infrastructure Engineer

7 days ago


Singapore Assurity Trusted Solutions Full time

Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a dynamic digital and cyber landscape, where trust & collaboration are key, ATS continues to drive mutually beneficial business outcomes through collaboration with GovTech, government agencies and commercial partners to mitigate cyber risks and bolster security postures.
We are looking for a Cloud Infrastructure Engineer (Kubernetes, Redhat Openshift) to join us This will be on a 3 year contract (subjected to extension).
You will be working on:
Design, deploy, and optimize Kubernetes clusters using the Nvidia software stack to support large language model applications.
Collaborate with cross-functional teams to integrate Nvidia GPU resources effectively within Kubernetes environments, ensuring optimal performance.
Implement and manage infrastructure as code (IaC) for Nvidia GPU configurations, focusing on scalability and high availability.
Monitor, troubleshoot, and resolve issues related to both Kubernetes clusters and Nvidia GPU resources to maintain a reliable and performant infrastructure.
Stay abreast of industry best practices and emerging technologies related to Kubernetes and the Nvidia GPU ecosystem.
Work closely with development teams to automate deployment processes, leveraging Nvidia GPU capabilities, and streamline workflows.
Implement security best practices to safeguard Kubernetes environments, Nvidia GPU resources, and sensitive data.
Participate in on-call rotation and provide timely response to incidents, minimizing downtime for language model applications.
Contribute to capacity planning and performance tuning activities, considering the demands of large-scale language model applications utilizing Nvidia GPU acceleration.
Document infrastructure configurations, processes, and procedures, facilitating knowledge sharing and team member onboarding.
To succeed in this role, you will ideally have:
Proven experience in designing, implementing, and managing on-premises infrastructure solutions.
Strong knowledge of server virtualisation, storage systems and network infrastructure.
Hands-on experience with cloud-native technologies and deployment strategies.
Proven experience designing, deploying, and managing Kubernetes clusters such as SUSE Rancher, RedHat OpenShift.
Strong understanding of containerization concepts such as Docker, orchestration tools like Kubernetes and Nvidia GPU acceleration technologies.
Proficiency in scripting, automation and configuration management using tools such as Ansible, Terraform, or similar.
Familiarity with infrastructure-as-code principles and tools (e.g., Helm, Kubernetes manifests).
Experience with large-scale language model applications, particularly leveraging Nvidia GPU acceleration, is highly desirable.
Solid knowledge of networking concepts, Kubernetes networking models, and integration with Nvidia GPU resources.
Excellent problem-solving and troubleshooting skills, with a proactive approach to system optimization.
Strong communication skills for effective collaboration in a team-oriented, agile environment.
Join us and discover a meaningful and exciting career with Assurity Trusted Solutions
The remuneration package will be commensurate with your qualifications and experience. Interested applicants, please click "Apply Now".
We thank you for your interest and please note that only shortlisted candidates will be notified.
By submitting your application, you agree that your personal data may be collected, used and disclosed by Assurity Trusted Solutions Pte. Ltd. (ATS), GovTech and their service providers and agents in accordance with ATS’s privacy statement which can be found at: or such other successor site.
#J-18808-Ljbffr


  • Cloud Engineer

    7 days ago


    Singapore Cloud Mile Inc. Full time

    Overview As a Cloud Engineer at CloudMile, you will play an implementation role to configure or modify cloud environments in accordance with a referencing cloud architecture designed and planned by a Solution Architect. You will work with cross-functional teams in Taiwan, including solution architects, software developers, data scientists, and machine...


  • Singapore DC FRONTIERS PTE. LTD. Full time

    Handshakes is an award-winning DataTech company. We enable our clients to make safer, more informed decisions by delivering meaningful insights, harnessed from reliable data. Through our products, clients gain greater clarity on their businesses, clients, partners, and competitors. We are looking for an experienced **Cloud **Infrastructure Engineer **who...


  • Singapore Screening Eagle Technologies AG Full time

    Intro As a Cloud Infrastructure Engineer, you will run and support the company's software application infrastructure in the cloud, ensuring the optimized use of resources to scale. What will you do Design, develop, and implement cloud infrastructure as code using Terraform. Manage and optimize cloud resources for cost-efficiency and performance. Secure...


  • Singapore SCREENING EAGLE SINGAPORE PTE. LTD. Full time

    As a Cloud Infrastructure Engineer, you will run and support the company's software application infrastructure in the cloud, ensuring the optimized use of resources to scale. What will you do Design, develop, and implement cloud infrastructure as code using Terraform. Manage and optimize cloud resources for cost-efficiency and performance. Secure...


  • Singapore ELLIOTT MOSS CONSULTING PTE. LTD. Full time

    **Position Overview**: We are seeking a skilled Cloud Engineer with a strong background in network engineering and cloud computing. In this role, you will design, implement, and manage robust cloud infrastructure solutions, ensuring high performance, scalability, and security for our systems. **Key Responsibilities**: - Design, deploy, and manage AWS cloud...

  • Cloud Engineer

    7 days ago


    Singapore RAPTOR INSIGHTS PTE. LTD. Full time

    **Responsibilities** - To assess the infrastructure of an organisation’s technological systems and making necessary migrations to the cloud; to oversee working standards of cloud-based systems and making improvements as and when necessary. - To ensure all necessary security issues are take care of, including the need to keep company data protected in the...


  • Singapore Centre for Strategic Infocomm Technologies (CSIT) Full time

    **SINGAPORE, SINGAPORE /** **CLOUD INFRASTRUCTURE AND SERVICES - CLOUD INFRASTRUCTURE & SERVICES /** **FULL-TIME** - You will be part of a dynamic team responsible for maintaining, implementing, designing, exploring and adopting the latest cloud technologies to modernise CSIT’s cloud services and solutions, mainly operating in closed networks. Our ideal...


  • Singapore TALENTSIS PTE. LTD. Full time

    We are looking for a skilled **Cloud Infrastructure Engineer**to manage cloud infrastructure and provide 24x7 support. The role involves troubleshooting and resolving cloud-related issues to ensure continuous uptime. **Key Responsibilities**: - Cloud Management: Maintain and monitor cloud infrastructure (AWS, Azure, Google Cloud) for availability and...


  • Singapore Unison Consulting Pte Ltd Full time

    Role Overview: We are looking for a skilled Infrastructure Cloud Engineer with strong expertise in Terraform and CI/CD practices to join our team. The ideal candidate will have hands-on experience in GitHub Actions , Databricks workspace setup on AWS , and cloud automation. This role involves designing, deploying, and maintaining cloud infrastructure...


  • Singapore Unison Consulting Pte Ltd Full time $80,000 - $120,000 per year

    Role Overview:We are looking for a skilled Infrastructure Cloud Engineer with strong expertise in Terraform and CI/CD practices to join our team. The ideal candidate will have hands-on experience in GitHub Actions, Databricks workspace setup on AWS, and cloud automation. This role involves designing, deploying, and maintaining cloud infrastructure using...