New option called `"best"` for `args.save_strategy`. #31817

seanswyi · 2024-07-06T13:40:00Z

What does this PR do?

Addresses #31626.

Adds a new option called "best" for TrainingArguments.save_strategy which saves the model checkpoint each time a new best performance is achieved.

Details

The previous _save_checkpoint method was in charge of not only saving the model checkpoint but also determining the best metric and best checkpoint. The logic for determining a new best metric was separated out into the _determine_best_metric method.
_determine_best_metric is called after every evaluation inside of _maybe_log_save_evaluate. The return value new_best_metric is used to determine whether or not a new best metric has been achieved, and if the save strategy is "best" then the TrainerControl's should_save flag is switched on.
- Contrary to what I initially thought, best_metric does not seem to be tracked by default. Rather, it's only tracked when args.metric_for_best_model is provided. I believe that a best metric of some sort should always be tracked, and therefore if a value is not provided then the validation loss is used to determine a new best.
A new object called SaveStrategy was created in trainer_utils that adds a new attribute called BEST to the previous IntervalStrategy.

I'm not sure if I like the rather "hack-y" way that I implemented this by manually switching the TrainerControl's should_save flag rather than delegating it to the callback handler like the other flags are dealt with. The problem is that the flags are normally updated before calling _maybe_log_save_evaluate inside of the inner training loop, which means there's no way for us to determine whether or not a new best metric has been achieved with the current logic. I'm not sure if I'm making sense, but I'm open to any other suggestions.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@muellerzr @SunMarc

`save_strategy` previously followed `IntervalStrategy` but now follows `SaveStrategy`. Changes were made accordingly to the code and the docstring.

1. Logic to determine the best logic was separated out from `_save_checkpoint`. 2. In `_maybe_log_save_evaluate`, whether or not a new best metric was achieved is determined after each evaluation, and if the save strategy is "best' then the TrainerControl is updated accordingly.

Same as IntervalStrategy, but with a new attribute called BEST.

…strategy' into feat/new-best-save-strategy

HuggingFaceDocBuilderDev · 2024-07-08T16:29:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

seanswyi added 11 commits July 6, 2024 14:14

Added SaveStrategy and made according changes.

d4b81ca

`save_strategy` previously followed `IntervalStrategy` but now follows `SaveStrategy`. Changes were made accordingly to the code and the docstring.

Added SaveStrategy.

5adf4c6

Same as IntervalStrategy, but with a new attribute called BEST.

IntervalStrategy -> SaveStrategy

8a47852

IntervalStratgy -> SaveStrategy for save_strat.

43f8dc5

Interval -> Save in docstring.

9d5484d

Updated docstring for save_strategy.

8d21813

Merge remote-tracking branch 'refs/remotes/origin/feat/new-best-save-…

c86fbba

…strategy' into feat/new-best-save-strategy

Changes from make fixup.

8263575

Applied make fixup.

1e16bd2

Removed redundant metrics argument.

a114602

amyeroberts added the trainer label Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New option called `"best"` for `args.save_strategy`. #31817

New option called `"best"` for `args.save_strategy`. #31817

seanswyi commented Jul 6, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 8, 2024

New option called "best" for args.save_strategy. #31817

Are you sure you want to change the base?

New option called "best" for args.save_strategy. #31817

Conversation

seanswyi commented Jul 6, 2024 • edited Loading

What does this PR do?

Details

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Jul 8, 2024

New option called `"best"` for `args.save_strategy`. #31817

New option called `"best"` for `args.save_strategy`. #31817

seanswyi commented Jul 6, 2024 •

edited

Loading