Research Team Collects 10,000 Hours of Neuro-Language Data for Thought-to-Text Models
By
nee1r
A five-star bake. Worth schmearing, sharing, saving.
Summary
A research team has collected approximately 10,000 hours of neuro-language data from thousands of individuals over six months, claiming it to be the world's largest dataset of its kind. The data collection supports their work on training thought-to-text models that decode semantic content from noninvasive neural data, with the goal of enabling brain-computer interfaces for communication. The article discusses their methodology, challenges with existing small datasets, and presents zero-shot examples of their model's capabilities.
Key quotes
· 4 pulledOver the last 6 months, we collected ~10k hours of data across thousands of unique individuals.
As far as we know, this is the largest neuro-language dataset in the world.
We train thought-to-text models. That is, we train models to decode semantic content from noninvasive neural data.
Here are some entirely zero-shot examples:
You might also wanna read
Living Human Brain Cells Used to Play DOOM Video Game
The article appears to be about a scientific demonstration where living human brain cells were used to play the video game DOOM on a CL1 dev

Analysis: Eon Systems' Claims of Digital Fruit Fly Consciousness and Brain Emulation
The article examines Eon Systems' controversial claim of creating the 'world's first embodiment of a whole-brain emulation' - a digital frui

DishBrain: In Vitro Neural Networks Learn and Adapt in Simulated Game Environment
Researchers have developed DishBrain, a synthetic biological intelligence platform where in vitro neural networks from human or rodent cells
Guide to Neuroimaging Datasets for Visual Perception Reconstruction from fMRI Data
This repository provides an index and overview of open neuroimaging datasets specifically for reconstructing visual perception from human fM
Dream Recorder Technology: How Devices Capture and Analyze Dreams During Sleep
The article discusses the Dream Recorder, a device that records and analyzes dreams by capturing brain activity during sleep. It explains ho
MMAcevedo: The Earliest Executable Image of a Human Brain
The article discusses MMAcevedo (Mnemonic Map/Acevedo), also known as Miguel, which is the earliest executable image of a human brain. It's
