Some of the most powerful NLP models like BERT and GPT-2 have one thing in common: they all use the transformer architecture.
Share this post
More powerful deep learning with transformers…
Share this post
Some of the most powerful NLP models like BERT and GPT-2 have one thing in common: they all use the transformer architecture.