BarraCUDA: Open-Source CUDA Compiler Supports AMD, NVIDIA, and Tenstorrent GPUs
By
rurban
Pulled from the oven just right. Trustworthy, fact-dense, deeply satisfying.
Summary
BarraCUDA is an open-source CUDA C++ compiler written from scratch in C99 that can compile .cu files to multiple GPU architectures including AMD RDNA 2/3/4 binaries, NVIDIA PTX, and Tenstorrent Tensix C++. The project was created as a response to NVIDIA's walled garden approach, aiming to provide a dependency-free compiler without using LLVM. The developer describes the challenge as significant but completed the project anyway, with plans to support more architectures in the future.
Key quotes
· 4 pulledAn open-source CUDA C++ compiler written from scratch in C99 that takes .cu files and compiles them to AMD GPU machine code, NVIDIA PTX, and Tenstorrent Tensix C++, with more architectures planned.
No LLVM, no dependencies, and no permission asked.
This is what happens when you look at NVIDIA's walled garden and think 'how hard can it be?' The answer is: quite hard, actually, but I did it anyway.
Takes CUDA C source code, the same .cu files you'd feed to nvcc, and compiles them to AMD RDNA 2/3/4 binaries, NVIDIA PTX, or Tenstorrent Tensix Metaliu
You might also wanna read
Comparing AI Agent Frameworks: Hermes Agent, AutoGPT, OpenAI Agents, and CrewAI in 2026
A practical, engineering-focused comparison of major AI agent frameworks in 2026, including Hermes Agent, AutoGPT, OpenAI Agents, and CrewAI
cstu.io·22h agoRotary GPU: Enabling Large Mixture-of-Experts Models on Consumer Laptop GPUs with Limited Memory
This paper presents Rotary GPU, an exploratory approach to running large Mixture-of-Experts (MoE) language models on consumer-grade hardware
openrsync: A BSD-Licensed Implementation of rsync for OpenBSD and Unix Systems
This article describes openrsync, a BSD (ISC) licensed implementation of the rsync file synchronization tool. It has been merged into OpenBS
openrsync: A BSD-Licensed Implementation of rsync for OpenBSD and Unix Systems
This article describes openrsync, a BSD (ISC) licensed implementation of the rsync file synchronization tool. It has been merged into OpenBS
StepFun Releases Step 3.5 Flash: 196B Sparse MoE Model for OpenClaw Agents
StepFun has released Step 3.5 Flash, a 196B sparse Mixture of Experts (MoE) model that activates only 11B parameters per token for high effi
LibreOffice Board Outlines Web and Mobile Strategy While Maintaining Desktop Focus
The Document Foundation (TDF) held meetings in April and May 2024 to discuss the future strategy for LibreOffice across desktop, mobile, and
blog.documentfoundation.org·4d ago