irradiated@radiation.partyMB to TechNews@radiation.party · 2 years ago[HN] Think Before You Speak: Training Language Models with Pause Tokensarxiv.orgexternal-linkmessage-square0fedilinkarrow-up12file-textcross-posted to: machinelearning@kbin.socialhackernews@lemmy.smeargle.fanshackernews@derp.foo
arrow-up12external-link[HN] Think Before You Speak: Training Language Models with Pause Tokensarxiv.orgirradiated@radiation.partyMB to TechNews@radiation.party · 2 years agomessage-square0fedilinkfile-textcross-posted to: machinelearning@kbin.socialhackernews@lemmy.smeargle.fanshackernews@derp.foo