What are examples of transformer (architecture)?

GPT-4: Decoder-only transformer with ~1.8T parameters. BERT: Encoder-only transformer for understanding tasks. T5: Encoder-decoder transformer for text-to-text tasks.

Transformer (Architecture)

A neural network architecture that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of all modern LLMs.

In Depth

Introduced in the 2017 paper "Attention Is All You Need," the transformer architecture revolutionized natural language processing. Unlike RNNs that process tokens sequentially, transformers use self-attention to weigh the relevance of all tokens simultaneously, enabling massive parallelization. This architecture powers GPT (decoder-only), BERT (encoder-only), and T5 (encoder-decoder) models. Transformers scale effectively with more parameters and data, leading to the current era of large language models.

Examples

GPT-4: Decoder-only transformer with ~1.8T parameters
BERT: Encoder-only transformer for understanding tasks
T5: Encoder-decoder transformer for text-to-text tasks

Try It Yourself

AI Detector

Frequently Asked Questions

A neural network architecture that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of all modern LLMs.

Transformer (Architecture)

A neural network architecture that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of all modern LLMs.

In Depth

Examples

GPT-4: Decoder-only transformer with ~1.8T parameters
BERT: Encoder-only transformer for understanding tasks
T5: Encoder-decoder transformer for text-to-text tasks

Try It Yourself

AI Detector

Frequently Asked Questions

A neural network architecture that uses self-attention mechanisms to process sequential data in parallel, forming the foundation of all modern LLMs.

Transformer (Architecture)

In Depth

Examples

Try It Yourself

Related Terms

Frequently Asked Questions

Transformer (Architecture)

In Depth

Examples

Try It Yourself

Related Terms

Frequently Asked Questions

Transformer (Architecture)

In Depth

Examples

Try It Yourself

Related Terms

Frequently Asked Questions

What is Transformer (Architecture)?

What are examples of transformer (architecture)?

Command Palette

Transformer (Architecture)

In Depth

Examples

Try It Yourself

Related Terms

Frequently Asked Questions

What is Transformer (Architecture)?

What are examples of transformer (architecture)?