Topics

Transformers use a network architecture that relies onĀ attention mechanismĀ to weigh the influence of different input parts on each output part. The original paper designed the architecture as:

The key components are:

Applications: Machine translation, text summarization, NER, Question answering, text generation, chatbots, computer vision and more.