All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

AI Search - New Workers AI models for text generation and embedding in AI Search

2mo ago

Source

CloudflareAI Search - New Workers AI models for text generation and embedding in AI Searchcloudflare.com
Snippet from the RSS feed
AI Search now supports four additional Workers AI models across text generation and embedding. Text generation Model Context window (tokens) @cf/zai-org/glm-4.7-flash 131,072 @cf/qwen/qwen3-30b-a3b-fp8 32,000 GLM-4.7-Flash is a lightweight model from Zhipu AI with a 131,072 token context window, suitable for long-document summarization and retrieval tasks. Qwen3-30B-A3B is a mixture-of-experts model from Alibaba that activates only 3 billion parameters per forward pass, keeping inference fast while maintaining strong response quality. Embedding Model Vector dims Input tokens Metric @cf/qwen/qwen3-embedding-0.6b 1,024 4,096 cosine @cf/google/embeddinggemma-300m 768 512 cosine Qwen3-Embedding-0.6B supports up to 4,096 input tokens, making it a good fit for indexing longer text chunks. EmbeddingGemma-300M from Google produces 768-dimension vectors and is optimized for low-latency embedding workloads. All four models are available without additional provider keys since they run on Workers AI. Select them when creating or updating an AI Search instance in the dashboard or through the API. For the full list of supported models, refer to Supported models .

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.