All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Transformer Neural Network Implemented in HyperTalk for Classic Macintosh

By

hammer32

1mo ago· 9 min readenCode

Summary

A complete transformer neural network implemented entirely in HyperTalk, a 1987 scripting language designed for interactive card stacks, running on a Macintosh SE/30. The MacMind project is a 1,216-parameter single-layer single-head transformer that learns the bit-reversal permutation (the opening step of the Fast Fourier Transform) from random examples. It includes all standard transformer components: token embeddings, positional encoding, self-attention with scaled dot-product scores, cross-entropy loss, full backpropagation, and stochastic gradient descent, all written in HyperTalk.

Key quotes

· 3 pulled
MacMind is a 1,216-parameter single-layer single-head transformer that learns the bit-reversal permutation -- the opening step of the Fast Fourier Transform -- from random examples.
Every line of the neural network is written in HyperTalk, a scripting language from 1987 designed for making interactive card stacks, not matrix math.
It has token embeddings, positional encoding, self-attention with scaled dot-product scores, cross-entropy loss, full backpropagation, and stochastic gradient descent.
Snippet from the RSS feed
Single-layer transformer in HyperTalk for the classic Macintosh - SeanFDZ/macmind

You might also wanna read