Member-only story
Intro to Transformer AI Models: The 5 Fundamentals of the Transformer Approach

In this article about Transformer Models we discover the magic of tools like ChatGPT, which power the recent AI hype.
In an easy-to-read way we discover what’s under the hood of the Transformer models. We do that in very lightweight style that does not require a technical background.
— Let’s dive in…
#1 What are Transformers? A Quick Explanation of the Transformer Characteristics
When you hear the Tech and AI folks talk about the latest news around newest AI models, the terms “Transformer” or “Attention Mechanism” can frequently be overheard.
But what do they mean?
“Transformer” is the term for an AI model architecture that was introduced in 2017 by the famous Google research paper “Attention is all you need”. This architecture uses the “Attention Mechanism” to make better predictions. We’ll get to that a bit later.
Even more often than Transformer, you hear the term GPT, as it is also part of the term ChatGPT.
It is a more detailed description of a Transformer and stands for “Generative Pretrained Transformer.