Meta AI Releases Brain2Qwerty v2: MEG Transcription Typed Sentences for Brain-to-Text Conversion with 61% Word Accuracy

0 10 5 minutes read

Meta AI Releases Brain2Qwerty v2: MEG Transcription Typed Sentences for Brain-to-Text Conversion with 61% Word Accuracy

Meta AI recently launched Brain2Qwerty v2. Determining natural sentences from non-distracting mental recordings in real time. The program reads magnetoencephalography (MEG) signals while a person types. It rebuilds what they typed, untethered and unedited. This is a sequel to Brain2Qwerty v1, released in February 2025. Meta also releases the full training code for both versions. The pipeline includes a convolutional encoder, a transformer, and a character-level language model.

The TL;DR

Brain2Qwerty v2 determines typed sentences from MEG signals non-invasively, without implants or surgery.
It achieves 61% average word accuracy (39% WER), up from 8% for previous non-invasive methods.
The best participant achieved 78% word accuracy, with more than half of the sentences with one word error or less.
The pipeline pairs a convolutional encoder, transformer, and character-level language model, along with fine-tuned LLMs.
Accuracy scales log-linearly with data; training code for v1 and v2 is released under CC BY-NC 4.0.

What is Brain2Qwerty v2?

Brain2Qwerty v2 is a brain-to-text decoder. It maps immature brain activity to characters, then words and sentences.

Meta trained it on nearly 22,000 sentences from nine volunteer participants. Each participant was recorded for 10 hours while writing continuously.

The recording is from the MEG device. MEG measures the magnetic fields produced by neuronal activity, sampled at high temporal resolution.

The model uses letter, word and sentence level. That layered design allows it to correct local errors using a wider context.

Importantly, this is research, not a product. The decoder is not a consumer device, and has been tested on a small group of volunteers.

The data was collected with the BCBL of Spain (Basque Center on Cognition, Brain and Language). It belongs to that research center.

How the Decoding Pipeline Works

Previous non-invasive systems relied on manual taps to detect neural events. Brain2Qwerty v2 replaces that step with deep end-to-end learning.

According to the Meta repository, the model consists of three parts: a transcoding encoder, a converter, and a character-level language model.

A convolutional encoder reads the raw MEG signals. It reads features directly from the data instead of using engineered event detectors.

Long range structural transformer models for every signal. The character-level language model then limits the output to plain text.

The Meta research team describes three ways in which AI can make a difference. Each map to a concrete engineering decision they will see.

Deep learning replaces manual event detection.
Large language models have been fine-tuned to extract semantic representations.
AI agents have iteratively refined the decoding pipeline through automated code development. The final training configuration is still handpicked by the devs

Fine-tuning large-scale language models in neural data adds semantic context. That context closes the recording of the noisy brain and the corresponding language output.

Basically, the language model rejects sequences of letters that do not make up real words. It pushes the decoder to a sentence that one can write in a way that makes sense.

Here is a diagram showing the published architecture. It mirrors the defined components and is not a direct training code for Meta.

import torch
import torch.nn as nn

class Brain2QwertySketch(nn.Module):
    """Illustrative: convolutional encoder -> transformer -> char-level head.
    Reflects the components Meta describes, not the official implementation."""
    def __init__(self, n_meg_channels=306, d_model=256, n_chars=40):
        super().__init__()
        # 1) Convolutional encoder over raw MEG channels x time
        self.encoder = nn.Sequential(
            nn.Conv1d(n_meg_channels, d_model, kernel_size=7, padding=3),
            nn.GELU(),
            nn.Conv1d(d_model, d_model, kernel_size=5, padding=2),
            nn.GELU(),
        )
        # 2) Transformer models temporal structure
        layer = nn.TransformerEncoderLayer(d_model, nhead=8, batch_first=True)
        self.transformer = nn.TransformerEncoder(layer, num_layers=6)
        # 3) Character-level head; a language model refines this downstream
        self.char_head = nn.Linear(d_model, n_chars)

    def forward(self, meg):           # meg: (batch, channels, time)
        x = self.encoder(meg)         # (batch, d_model, time)
        x = x.transpose(1, 2)         # (batch, time, d_model)
        x = self.transformer(x)       # contextualized features
        return self.char_head(x)      # (batch, time, n_chars)

To work with the original Meta code, match the repository and test both versions:

git clone 
# brain2qwerty_v1/ and brain2qwerty_v2/ hold the training code

Accuracy Numbers

Brain2Qwerty v2 achieves a 61% word accuracy rating. That equates to a word error rate (WER) of 39%.

For the best participant, the model achieves a word accuracy of 78%. For that participant, more than half of the sentences had one or fewer word errors.

The previous premise is important here. Meta reports that some non-invasive methods only achieved 8% word accuracy.

Accuracy also improves logging in line with data volume. Additional recording hours are predicted to increase the accuracy in the reported range.

That scaling behavior is an important claim for builders. It suggests that the gap in surgical coverage can be narrowed by data alone.

Metric	Brain2Qwerty v2	Previous non-invasive methods
Average word accuracy	61%	8%
Average word error rate (WER)	39%	–
Excellent participant name accuracy	78%	–
Recording method	MEG, it doesn't attack	It is invulnerable
Measuring behavior	Sign in with data	–

These numbers are from volunteers in a controlled condition. There are no clinical implications in brain-injured patients.

v1 vs v2: What's Changed

Brain2Qwerty v1 and v2 report different metrics, so compare carefully. v1 is rated at the character level, v2 at the word level.

A feature	Brain2Qwerty v1 (Feb 2025)	Brain2Qwerty v2 (Jun 2026)
Devices	MEG and EEG	MEG
Participants	35 healthy volunteers	9 volunteers
The data	Typed sentences	22,000 sentences, 10 hours each
Result reported	Up to 80% of grains (MEG)	61% average word accuracy
The level of representation	Character level	Letter, word and sentence level
Real-time decoding	It is not emphasized	Real-time sentence recording

v1 also showed the MEG recording was at least twice as good as the EEG system. EEG signals are noisier, which limits accuracy.

Use Cases with examples

The main motivation is to restore communication. Millions of people have brain lesions that prevent them from speaking or moving.
Invasive methods such as stereotactic electroencephalography and electrocorticography are already feeding the neuroprosthesis into the AI code. But they require neurosurgery and are difficult to measure.
A non-invasive decoder can extend access. The patient may be able to type sentences without the implant, using only external recordings.
For researchers, the extracted code supports replicable neuroscience. The lab can also train the pipeline on its MEG dataset.
For AI developers, the project is a biosignal decoding template. The convolutional-encoder-plus-transformer pattern applies to other biosignal functions.
For data scientists, the log-linear regression is a plotting tool. It frames how much new recording data can improve accuracy.

Interactive Descriptor

=1000?(Statistics.round(sent/100)/10)+'k':sent)+' sentence'; } dataEl.addEventListener('input',function(){dataVal.textContent=dataLabel(+dataEl.value);}); // —- accuracy model (maps device + data hours -> target name accuracy) —- // MEG v2 at 10h ~ 0.61 avg; log-linear in the data; EEG is very low (means before ~ 0.08). function targetAccuracy(){ var h=+dataEl.value, frac=Math.log(h)/Math.log(10); // 0..1 between 1..10h if(device==='meg'){ var lo=0.18, hi=0.61; // 1h floor -> 10h reported rate return lo+(hi-lo)*frac; }else{ var elo=0.05, ehi=0.22; // EEG stays low; ~prior non-invasive band returns elo+(ehi-elo)*frac; } } // —- helpers: corrupt characters, then “LM snap” in names —- function corruptChars(s,charErr){ var keys=”abcdefghijklmnopqrstuvwxyz”; return s.split('').map(function(ch){ if(ch===' ')return' '; if(Math.random()<charerr class=""c-bad"">'+keys[Math.floor(Math.random()*26)]+'';} return ch; }).join(''); } // word-level editing distance (Levenshtein on tokens) function wWords(ref,hyp){ var a=ref.split(/s+/),b=hyp.split(/s+/),n=a.length,m=b.length; iva d=[];for(var i=0;i<=n;i++){d[i]=[i];} for (var j=0;j<=m;j++){d[0][j]=j;} of (i=1;i<=n;i++)of (j=1;j<=m;j++){ var c=a[i-1]===b[j-1]?0:1; d[i][j]=Izibalo.min(d[i-1][j]+1,d[i][j-1]+1,d[i-1][j-1]+c); } buyisela {dist:d[n][m],n:n}; } // khiqiza i-hypothesis ehlukanisiwe ngegama elinikeziwe lokunemba ngokushintshanisa amagama athile var SWAPS={the:'they',meeting:'meaning',sheduled:'schedule',kusasa:'kusasa', ntambama:'aftermoon',sicela:'please',letha:'letha,'document', Ihhovisi:'amahhovisi',ingaba:'ingahle', njengokuthi:'ithandiwe',ingilazi:'ikilasi',elibandayo:'igolide',amanzi:'weta', isitimela:'imvula',ihamba:'ihambile',isiteshi:'iziteshi',isishiyagalolunye:'eyami',ukufunda:'okuhamba phambili', ibhuku:'bheka',ingadi:'qinisa',yena:'bona',ithi:'in',yami:'by',kuya:'nakhona',for:'far',of:'on',at:'as',a:'as',i:'a'}; // wonakalisa amagama angu-k ashintshashintshayo (k eqhutshwa ukunemba okuqondiwe) kumsebenzi wamamethrikhi azinzile we-decodeWords(s,wordAcc){ var words=s.split(' '); va idx=[];words.forEach(umsebenzi(w,i){if(SWAPS[w])idx.push(i);}); var nBad=Math.round((1-wordAcc)*words.length); nBad=Math.max(0,Math.min(nBad,idx.length)); // shova indices eshintshwayo, thatha i-nBad yokuqala yokuthi(var i=idx.length-1;i>0;i–){var j=Math.floor(Math.random()*(i+1));var t=idx[i]idx;[i]=idx[j]idx;[j]=t;} var bad={};idx.slice(0,nBad).forEach(function(i){bad[i]=1;}); return names.map(function(w,i){ if(bad[i]) return { w:SWAPS[w],correct:false}; return {w:w,ok:true}; }); } // —- scope animation —- function drawScope(progress, sound){ var W=canvas.width,H=canvas.height;ctx.clearRect(0,0,W,H); ctx.lineWidth=1.6; for(var lane=0;lane<3;lane++){ctx.beginPath(); var base=H*(0.32+lane*0.2), amp=12-lane*2, col=['#5fd0de','#8fe0ea','#3fb9c9'][lane]; ctx.strokeStyle=col;ctx.globalAlpha=0.5+lane*0.12; for (var x=0;x<=W;x+=4){ var p=x/W, on=p<progress var="" n="on?(Math.sin((x*0.05)+(t0*0.006)+lane)*amp" y="base+n;" if="" ctx.lineto="" ctx.stroke="" ctx.globalalpha="1;" sweep="" line="" sx="progress*W;" ctx.strokestyle="rgba(118,185,0,.8)" ctx.beginpath="" function="" setstage="" stages.foreach="" s.classlist.toggle="" animatenumber="" start="performance.now();" step="" k="Math.min(1,(now-start)/ms),e=1-Math.pow(1-k,3);" el.textcontent="Math.round(from+(to-from)*e)+suffix;" requestanimationframe="" run="" sentence="sentEl.value.trim();" targetel.textcontent="sentence;" noisyel.innerhtml="" runbtn.classlist.add="" slab.textcontent="acquiring · 1000 Hz" accel.textcontent="—" wordacc="targetAccuracy();" charerr="Math.min(0.7,(1-wordAcc)*0.85+" dur="2600," stagems="dur/4;" t0="0;" starttime="performance.now();" loop="" el="now-startTime," p="Math.min(1,el/dur);" stageidx="Math.min(3,Math.floor(p*4));" drawscope="">=1 && !noisyEl.dataset.set){ // display noisy characters in embedding section noisyEl.innerHTML=corruptChars(sentence,charErr); noisyEl.dataset.set=”1″; slab.textContent=”fix letters”; } if(p<1){animRAF=requestAnimationFrame(loop);}else{finish(wordAcc,sentence);} } noisyEl.dataset.set=""; animRAF=requestAnimationFrame(loop); function finish(wordAcc,sentence){ stages.forEach(functions){s.classList.remove('active');s.classList.add('finished'); }); slab.textContent="sentence found"; // creating LM corrected output var dec=decodeWords(sentence,wordAcc); finalEl.innerHTML=dec.map(function(o){ return ''+o.w+''; }).join(' '); var hyp=dec.map(function(o){return ow;}).join(' '); var w=werWords(sentence,hyp); // displayed accuracy tracking model target (stable), with light jitter var acc=Math.round(wordAcc*100 + (Math.random()*4-2)); acc=Math.max(0,Math.min(100,acc)); var werPct=100-acc; animateNumber(accEl,0,acc,'%',900,function(){ accBar.style.width=acc+'%'; }); animateNumber(werEl,0,werPct,'%',900,function(){werBar.style.width=werPct+'%';}); // element note var note=document.getElementById('b2qNote'); if(device==='eeg'){ note.innerHTML='EEG technique: electric field recording is very noisy, so accuracy remains low — around the ~8% band reported by previous non-invasive methods.'; }else if(+dataEl.value<10){note.innerHTML='Measuring data: at '+dataEl.value+' the model follows its ceiling. Accuracy increases log-linearly to a reported average of 61% at 10 h.'; }else{note.innerHTML='MEG v2 · full data: this remains close to what has been reported 61% average word accuracy. The best Meta participant has been reached 78%splitting more than half the sentences with one or less word errors.'; } busy=false;runBtn.classList.remove('busy');runBtn.disabled=false;runTxt.textContent=”Decode again”; postHeight(); } } runBtn.addEventListener('click',run); // idle scope blink (function void(){ if(!busy){ var W=canvas.width,H=canvas.height;ctx.clearRect(0,0,W,H); ctx.strokeStyle=”rgba(95,208,222,.25)”;ctx(thx.4.linevard);ctx.4 x=0;x<=W;x+=6) y=H*0.5+Math.sin(x*0.03+performance.now()*0.0014)*5; if(x===0)ctx.moveTo(x,y);else ctx.lineTo(x,y);}ctx.idle);(}} setTimeout(postHeight,400); })(); </progress></charerr>

Source link

nimda 3 weeks ago

0 10 5 minutes read