Don’t expect large language models like the next GPT to be democratized
This article is part of our coverage of the latest in AI research. In early May, Meta released Open Pretrained Transformer (OPT-175B), a large language model (LLM) that can perform various tasks. Large language models have become one of the hottest areas of research in artificial intelligence in the past few years. OPT-175B is the latest entrant in the LLM arms race triggered by OpenAI’s GPT-3, a deep neural network with 175 billion parameters. GPT-3 showed that LLMs can perform many tasks without undergoing extra training and only seeing a few examples (zero- or few-shot learning). Microsoft later integrated GPT-3 into several of…
This story continues at The Next Web
from The Next Web https://ift.tt/rS4VUPu
Comments
Post a Comment