adporn.net Fine-tuning BERT - TensorFlow: Working with NLP Video Tutorial | LinkedIn Learning, formerly Lynda.com

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

Start free trial Sign in

From the course: TensorFlow: Working with NLP

Unlock the full course today

Join today to access over 24,700 courses taught by industry experts.

Fine-tuning BERT

Fine-tuning BERT

From the course: TensorFlow: Working with NLP

Start my 1-month free trial Buy for my team

Fine-tuning BERT

“

- [Instructor] As part of the pre-training step, when Google trained BERT with the next sentence prediction task which is a text classification task, a linear layer was added at the end of the BERT model. The only thing that was fed into the linear layer was from the CLS embedding. So in order for the BERT model to perform well, it learned that it needed to capture all the information required in the CLS token. This means that when we want to fine tune BERT, say on movie reviews, all we need to do is to add a linear classify layer and use the final embedding of the CLS token as the input to the linear classifier. In addition to a linear classifier, we often add a dropout layer to reduce overfitting. We then train or fine tune the model with a label dataset. Using the movie review example, this would be training the linear classifier with the movie review texts and their associated labels, either positive or negative.…

Contents

- (Locked)
  
  Next steps
  
  47s