Saving TF model using save_pretrained()

by kostaras - opened Feb 12, 2024

Feb 12, 2024

•

edited Feb 12, 2024

While the initial tf_model.h5 is about 62.5 MB, when I load it using TFT5ForConditionalGeneration.from_pretrained() and then save it using save_pretrained() the saved model is about 126 MB. That is why it saves 3 times the shared tokens_embed layer (about 31.5 MB) instead of just once. Does someone know how to properly save the shared layer only one time ?

ybelkada

Feb 13, 2024

cc @Rocketknight1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment