NPU Communications Engineer

1 week ago


Singapore Bitdeer Technologies Group Full time $150,000 - $200,000 per year

About Bitdeer:

Bitdeer Technologies Group (Nasdaq: BTDR) is a leader in the blockchain and high-performance computing industry. It is one of the world's largest holders of proprietary hash rate and suppliers of hash rate. Bitdeer is committed to providing comprehensive computing solutions for its customers.

The company was founded by Jihan Wu, an early advocate and pioneer in cryptocurrency who cofounded multiple leading companies serving the blockchain economy. Headquartered in Singapore, Bitdeer has deployed mining datacenters in the United States, Norway, and Bhutan. It offers specialized mining infrastructure, high-quality hash rate sharing products, and reliable hosting services to global users. The company also offers advanced cloud capabilities for customers with high demands for artificial intelligence.

Dedication, authenticity, and trustworthiness are foundational to our mission of becoming the world's most reliable provider of full-spectrum blockchain and high-performance computing solutions. We welcome global talent to join us in shaping the future.

What you will be responsible for:

  • Design and implement foundational collective communication operators (e.g., Send, Receive, Broadcast, Gather, Reduce, All Reduce, All Gather, etc.) tightly coupled with the NPU (Neural Processing Unit) hardware architecture.
  • Optimize communication primitives to exploit hardware features like specialized communication links, on-chip interconnects, and DMA engines, minimizing latency and maximizing bandwidth.
  • Analyze different communication modes (blocking/non-blocking, sync/async, reliable/unreliable) in the context of chip microarchitecture to enhance throughput and reduce stalls.
  • Research and integrate communication algorithms (e.g., Ring, Hierarchical Decomposition) tailored for NPU topology and workload patterns, ensuring scalability across many compute nodes.
  • Ensure software-hardware co-design compatibility, verifying correctness and performance across the chip's instruction set, system software stack, and runtime environment.
  • Perform deep debugging and profiling using hardware-level tools and logs to rapidly identify bottlenecks or correctness issues and drive resolution.
  • Collaborate cross-functionally with chip architects, firmware engineers, and system software teams to deliver optimized communication solutions aligned with the overall AI accelerator roadmap.

How you will stand out:

  • Master's degree or higher in Computer Science, Electrical Engineering, Integrated Circuit Design, or related fields.
  • Proficient in C/C++ and Python programming with strong software engineering skills; experience with assembly or low-level programming for hardware optimization is highly valued.
  • Deep understanding of heterogeneous hardware platforms, especially NPU architecture including compute cores, on-chip memory hierarchies, and communication fabrics.
  • Solid grasp of collective communication principles and algorithms, including the implementation of efficient communication primitives on hardware accelerators.
  • Experience with performance profiling and debugging at hardware-software boundaries, able to use tools like logic analyzers, hardware performance counters, and trace logs.
  • Excellent problem-solving skills and ability to work in a collaborative, cross-disciplinary environment.
  • Bonus skills include knowledge of GPU/TPU/DPU/NPU architectures, CUDA/ROCm programming, RDMA, communication libraries like NCCL, and distributed AI training frameworks.

What you will experience working with us:

  • A culture that values authenticity and diversity of thoughts and backgrounds;
  • An inclusive and respectable environment with open workspaces and exciting start-up spirit;
  • Fast-growing company with the chance to network with industrial pioneers and enthusiasts;
  • Ability to contribute directly and make an impact on the future of the digital asset industry;
  • Involvement in new projects, developing processes/systems;
  • Personal accountability, autonomy, fast growth, and learning opportunities;
  • Attractive welfare benefits and developmental opportunities such as training and mentoring;

Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws. Bitdeer does not discriminate against employees or applicants based on conditions such as race, colour, gender identity and/or expression, sexual orientation, marital and/or parental status, religion, political opinion, nationality, ethnic background or social origin, social status, disability, age, indigenous status, and union.



  • Singapore beBeeCommunication Full time $90,000 - $120,000

    Job DescriptionThe primary responsibility of this position is to design and implement collective communication operators that are tightly coupled with the NPU hardware architecture.Achieve optimal communication performance by optimizing primitives to exploit hardware features such as specialized communication links, on-chip interconnects, and DMA engines,...


  • Singapore Bitdeer Group Full time

    About Bitdeer: Bitdeer Technologies Group (Nasdaq: BTDR) is a leader in the blockchain and high-performance computing industry. It is one of the world’s largest holders of proprietary hash rate and suppliers of hash rate. Bitdeer is committed to providing comprehensive computing solutions for its customers. Headquartered in Singapore, Bitdeer has deployed...


  • Singapore beBeeNpu Full time $90,000 - $120,000

    Job SummaryWe are seeking an experienced Senior NPU Communication Specialist to join our team. The successful candidate will be responsible for designing and implementing foundational collective communication operators tightly coupled with the NPU hardware architecture. This role involves optimizing communication primitives to exploit hardware features,...


  • Singapore beBeeCommunication Full time $120,000 - $180,000

    Role Description:As a key member of our high-performance computing team, you will be responsible for designing and implementing foundational collective communication operators that are tightly coupled with the NPU hardware architecture. Your expertise in optimizing communication primitives to exploit hardware features like specialized communication links,...


  • Singapore Bitdeer Technologies Group Full time $80,000 - $120,000 per year

    About Bitdeer: Bitdeer Technologies Group (Nasdaq: BTDR) is a leader in the blockchain and high-performance computing industry. It is one of the world's largest holders of proprietary hash rate and suppliers of hash rate. Bitdeer is committed to providing comprehensive computing solutions for its customers. The company was founded by Jihan Wu, an early...

  • NPU Design Engineer

    4 weeks ago


    Singapore OMNIVISION TECHNOLOGIES SINGAPORE PTE. LTD. Full time

    ResponsibilitiesDevelop design requirements for an NPU based on system-level specifications.Being part of modelling the performance of the NPU module and its data transaction throughput.Microarchitecture design and RTL coding using Verilog / System Verilog HDL for various sub-blocks of the NPU.Understanding the mathematics of different convolution operators...

  • NPU Design Engineer

    4 weeks ago


    Singapore OVT group Full time

    Responsibilities:Develop design requirements of an NPU given system level specifications.Being part of modelling the performance of the NPU module and its data transaction throughput.Microarchitecture design and RTL coding using Verilog / System Verilog HDL for various sub-blocks of the NPU.Understanding the mathematics of different convolution operators...

  • NPU Design Engineer

    1 week ago


    Singapore OMNIVISION TECHNOLOGIES SINGAPORE PTE. LTD. Full time $125,000 - $175,000 per year

    Responsibilities: Develop design requirements for an NPU based on system-level specifications. Being part of modelling the performance of the NPU module and its data transaction throughput. Microarchitecture design and RTL coding using Verilog / System Verilog HDL for various sub-blocks of the NPU. Understanding the mathematics of different...

  • NPU Design Engineer

    5 days ago


    Singapore OMNIVISION TECHNOLOGIES SINGAPORE PTE. LTD. Full time

    Responsibilities Develop design requirements for an NPU based on system-level specifications. Being part of modelling the performance of the NPU module and its data transaction throughput. Microarchitecture design and RTL coding using Verilog / System Verilog HDL for various sub-blocks of the NPU. Understanding the mathematics of different convolution...

  • NPU Design Engineer

    4 days ago


    Singapore OMNIVISION Full time

    Responsibilities: Develop design requirements of an NPU given system level specifications. Being part of modelling the performance of the NPU module and its data transaction throughput. Microarchitecture design and RTL coding using Verilog / System Verilog HDL for various sub-blocks of the NPU. Understanding the mathematics of different convolution...