Train a Sentence Generating Model for Korean Data


Codes for training a Sentence Generating Model for Korean Data

Published by Haerim Hwang

BERT sentence generation natural language processing google colab python

0 min READ

  • This script creates a finetuned BERT model to generate sentences using the NSMC (Naver Sentiment Movie Corpus) corpus. We use Google Colab for the script, which allows us to use GPU instead of CPU.

  • Click HERE for the Python script in Google Colab.

  • These codes were revised based on the following references.
    https://ratsgo.github.io/nlpbook/docs/generation/train/

    https://ratsgo.github.io/nlpbook/docs/generation/inference1/

    https://ratsgo.github.io/nlpbook/docs/generation/inference2/

    https://huggingface.co/blog/how-to-generate