GPT-5 could be OpenAI’s most powerful AI model: Capabilities users would like to see

The work on GPT-5 is ongoing. While the details are too far and in between, here are a few things to expect from OpenAI’s most powerful AI model.

GPT-5 could potentially be the next big thing in AI. (File photo: Reuters/Dado Ruvic)

Much of the future is possibly going to be dominated by Artificial Intelligence (AI). If humans from the future travel back in time, 2022 will be seen as a seminal year in the progress of AI. Ever since the launch of ChatGPT, OpenAI has been on a roll with praises and criticism in equal measure. Regardless, OpenAI’s success with ChatGPT has prompted big players like Google, Meta, and Anthropic to bring their best AI forward, further intensifying the AI arms race.

A few months after ChatGPT, powered by GPT-3, was released, OpenAI unveiled its most powerful model yet – GPT4. Between GPT-3 and GPT-4, there has been a considerable amount of advancement. GPT-4 came with better creativity and problem-solving skills. In May this year, OpenAI introduced GPT-4o, yet another version of the GPT-4, which it claims to be their fastest, most powerful AI model so far. This model also brought the capabilities of GPT-4 to free users.

While OpenAI has been incrementally improving its AI models, CEO Sam Altman has, on many occasions, said that all of these are its efforts towards achieving Artificial General Intelligence (AGI). However, there is a heightened anticipation among users worldwide about GPT-5. Although Sam Altman has been tight-lipped about its launch, he has hinted many times that the next iteration of the GPT series is in the works. As of now, ChatGPT has audio and vision capabilities, one can only imagine how GPT-5 would be when it finally sees daylight.

Story continues below this ad

Here are a few things that users and developers expect GPT-5 to introduce.

Bigger context window

Perhaps one of the most anticipated aspects of the upcoming GPT-5 model would have to be its context window. GPT-4o currently has a context length of 128k tokens; experts and analysts predict that the GPT-5 will have a significantly larger context window. This would allow the model to process larger chunks of text and offer a comparison between dense documents, each having hundreds of pages at the same time.

In simple words, the amount of conversation that an AI model can read and write at any point is known as context windows and they are measured in tokens. When it comes to GPT-4o, its 128K tokens would translate to around 300 pages of a book. If GPT-5 comes with a massive context window, this could mean it would be able to process larger volumes of literature.

Multimodal capabilities

Reportedly, in June, OpenAI confirmed that the upcoming GPT-5 will feature multimodal capabilities, meaning it will process text, images, videos, speech, and even code. The new model is likely to have improved accuracy when it comes to reasoning. With these, GPT-5 is likely to offer some incredible use cases like content creation, programming, translation, customer service, and more. With multimodality, GPT-5 will be far more efficient when working with audio, video, and text interchangeably. This could bring up new possibilities for video analysis.

Story continues below this ad

Also Read | From partners to competitors: 5 ways Microsoft-OpenAI relationship has evolved

Touted as being the next big frontier of AI, GPT-5 will not be limited to text and image generation; it could even come up with AI-generated voices and visuals as well as animations. This could likely impact how AI would assist users in their day-to-day lives.

Improved reasoning

With regards to AI, reasoning is a process by which AI models analyse and synthesise information, and how they apply these to make inferences, draw conclusions, and solve complex problems. Reasoning is fundamental to AI models as it allows machines to imitate human thought processes and showcase intelligent behaviour. With improved reasoning, an AI chatbot can help a user understand their food preferences, and budget and maybe even assist them with grocery shopping.

At an interview hosted by Dartmouth Engineering, OpenAI CTO Mira Murati explained that the GPT-3 had the intelligence of a toddler, while GPT-4 was more similar to a smart high-schooler. The OpenAI executive said that the generation of the GPT series will look towards having PhD-level intelligence concerning certain tasks and activities. Incidentally, OpenAI recently formed a new Safety and Security Team headed by Altman, and this team will be looking towards reducing inaccuracies and improving the logical reasoning of the forthcoming AI model.

Powered by multi-agent systems?

Firstly, an AI agent is a programme that can perform tasks on behalf of a user and it often comes with a certain level of autonomy and intelligence that mimics human behaviour patterns. Multi-agent systems are sets of multiple specialised AI agents that break down complex tasks into multiple small steps with each handling a specific task. And, all of this is done to achieve a common goal. To understand, picture having AI agents to help you fill out forms, shop online, order food, etc. AI enthusiasts see multi-agent systems as the next big thing in AI. In one of his old interviews, Altman had said that future models would be able to process emails and calendar details, and this could potentially be how AI agents work.

Story continues below this ad

Also Read | Mysterious chatbot fuels rumours around OpenAI’s advanced GPT-4.5 model

If the reports are true, GPT-5 may likely come up with AI agents that would help users in completing tasks that involve more than one complex task. For example, GPT-5 may be able to read a legal document and even cross-references with various laws and regulations. This could make AI models smarter and more efficient with information.

So, when is GPT-5 arriving?

Reportedly, Sam Altman while talking at the Aspen Ideas Festival, said that OpenAI has a lot of work to do on the next iteration of ChatGPT. It seems the company wants to fix issues that were seen with the GPT-4 series in the new version. He, however, did not divulge more details on the launch date of the model which is under development.

The above information has been based on information available on the internet. It needs to be noted that there is no information out on GPT-5 yet, but if you track the progress of the Generative Pre-trained Transformer over the years, the next generation of GPT is likely to come with significant improvements, especially in areas like logical reasoning and vision capabilities.

Bijin Jose

Bijin Jose, an Assistant Editor at Indian Express Online in New Delhi, is a technology journalist with a portfolio spanning various prestigious publications. Starting as a citizen journalist with The Times of India in 2013, he transitioned through roles at India Today Digital and The Economic Times, before finding his niche at The Indian Express. With a BA in English from Maharaja Sayajirao University, Vadodara, and an MA in English Literature, Bijin's expertise extends from crime reporting to cultural features. With a keen interest in closely covering developments in artificial intelligence, Bijin provides nuanced perspectives on its implications for society and beyond. ... Read More