All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Mercury: New Generation of Large Language Models for Coding Applications

By

PaulHoule

10mo ago· 2 min readenNews

Summary

Mercury is a new generation of large language models based on diffusion, designed for coding applications. It includes Mercury Coder Mini and Small, setting a new state-of-the-art in speed and quality. Independent evaluations show significant performance improvements over existing models.

Key quotes

· 3 pulled
Mercury Coder Mini and Mercury Coder Small achieve state-of-the-art throughputs of 1109 tokens/sec and 737 tokens/sec, respectively, on NVIDIA H100 GPUs.
We also release a public API at https://platform.inceptionlabs.ai/ and free playground at https://chat.inceptionlabs.ai
These models set a new state-of-the-art on the speed-quality frontier.
Snippet from the RSS feed
We present Mercury, a new generation of commercial-scale large language models (LLMs) based on diffusion. These models are parameterized via the Transformer architecture and trained to predict multiple tokens in parallel. In this report, we detail Mercury

You might also wanna read