Google's New Infini-attention Technique Lets You Input Infinite Text Into Llms

Серверы с серверами

gemini-ai-on-google-messages — Artie Beaty/

Today's large language models (LLMs) have limits on how much information you can input before they give you a result. Google has unveiled a way to change that: a method that allows LLMs to accept an infinite amount of text. The technique, called Infini-attention, works without sacrificing memory and computational power, creating a more efficient -- and potentially impactful -- LLM result.

"An effective memory system is crucial not just for comprehending long contexts with LLMs, but also for reasoning, planning, continual adaptation for fresh knowledge, and even for learning how to learn," the authors wrote in a research paper accompanying their announcement.

Also: GPT-4 Turbo reclaims 'best AI model' crown from Anthropic's Claude 3

Context windows play a central role in how LLMs operate, and as of this writing, all popular AI models, including OpenAI's GPT-4 and Anthropic's Claude 3, have a finite context window. Claude 3, for example, allows for up to 200,000 tokens, or alphanumeric characters, in a single query. GPT-4's context window allows for 128,000 tokens.

The context window matters a lot for LLMs. The more tokens allowable in the context window, the more data users can input to generate their desired result. LLM creators therefore try to increase the number of tokens with each new iteration to make their models more effective at learning, understanding, and delivering results.

In order to do so, however, tech companies need to accommodate for memory and computing requirements. With every doubling of an LLM's context window, the memory and computational requirements increase by a factor of four, the Google researchers wrote. Each increase in memory and computational power is naturally not just resource intensive, but exceedingly expensive.

Also: Adobe Premiere Pro's two new AI tools blew my mind. Watch them in action for yourself

Google's Infini-attention solves for this problem by using existing memory and computational requirements. When the researchers input additional detail into a context window beyond the limitations of the models they tested, they transferred all of the data up to the limit into what's called "compressive memory" and removed it from active memory, which was then freed up for the additional context. Once all of the data was inputted, the model was able to pair the compressive memory with all the input in its active memory to deliver a response. This technique enables "a natural extension of existing LLMs to infinitely long contexts via continual pre-training and finetuning," the researchers wrote.

Armed with the ability to put as much context into their models as they wished, the researchers compared their Infini-attention technique against existing LLMs and found their option was superior. "Our approach can naturally scale to a million length regime of input sequences, while outperforming the baselines on long-context language modeling benchmark and book summarization tasks," the researchers wrote.

The researchers didn't share their data or prove that their method indeed performs better than existing models. It stands to reason, however, that if they can eliminate context window limitations, models equipped with this technique should outperform those with limits in place.

Google's technique could pave the way for dramatic improvements in LLM performance, allowing for companies to create new applications, generate additional insights, and more. For now, though, Infini-attention is purely research. It's unclear whether the technique will make its way to broadly-available LLMs.

Featured

I tested Lenovo's dual-screen laptop and it improved my productivity in profound ways
Samsung Galaxy Book 4 Ultra review: Should Windows users consider anything else?
Nothing's new$99 earbuds are the most stylish ones I've tested (and almost perfect)
The most charming projector I've tested has now replaced my TV for movie nights

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Google's new Infini-attention technique lets you input infinite text into LLMs

Featured

Горячие метки: 3. Инновации

Ordering Guide

Ресурсы по программам

О нас

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Google's new Infini-attention technique lets you input infinite text into LLMs

Featured

Горячие метки: 3. Инновации

Ordering Guide

Ресурсы по программам

О нас

Introduction to Huawei CloudEngine S6730-H Series Switches