GPT (the abbreviation GPT stands for Generative Pre-Trained Transformer), developed by Google in 2017 and further refined by OpenAI for a wide range of purposes, is a family of AI models based on the Transformer architecture and representing state-of-the-art models for data processing. What the GPT models have in common is that they have been ‘pre-trained’, i.e. they are trained on large amounts of data – such as text, images, videos, etc. – before being fine-tuned for specific tasks.
The best-known GPT model is arguably ‘ChatGPT’, developed by OpenAI, a leading company in the field of AI research, and first unveiled to the public in November 2022. ChatGPT functions like a chatbot and can respond to questions or queries by using machine learning to understand the context of the conversation and generate appropriate answers. To this end, ChatGPT was trained on a vast amount of data from the internet, including books, articles, websites and social media, as well as on the understanding of human language.
For example, the responses generated by the GPT-3.5 version were evaluated by several thousand test subjects in order to improve the accuracy and quality of the responses from a moral and ethical perspective. The next iteration, GPT-4, utilises even more data and greater computing power to answer increasingly complex queries. Thanks to its broader general knowledge and problem-solving abilities, GPT-4 can solve difficult problems with greater accuracy. It can also be more creative and collaborative than its predecessors by working alongside users on creative and technical writing tasks and, for example, learning a user’s writing style.