Log locally with tensorboard when wandb is disabled. #154

dirkmcpherson · 2024-05-08T16:13:42Z

What this does

This PR adds support for locally logging metrics and videos using tensorboard when wandb is disabled.

How it was tested

Ran with a short eval_freq and log_freq to ensure metric and video logging worked through multiple periods.
Tested with tensorboard.active = true / false and wandb.active = true / false
ran pytests (passed)

How to checkout & try? (for the reviewer)

Run with wandb.enable=false and short eval / log freqs. e.g.
python lerobot/scripts/train.py policy=diffusion env=pusht env.task=PushT-v0 dataset_repo_id=lerobot/pusht training.log_freq=10 training.eval_freq=100 wandb.enable=false tensorboard.enable=true

This change is

Cadene

Thanks, we should probably add option for tensorboard logging as it is quite used.
Anyone knows what is the usage for these kind of logging/viz tools in the community?
Is wandb more popular than tensorboard? What are the alternatives?

Cadene · 2024-05-09T23:07:14Z

lerobot/scripts/train.py

@@ -349,8 +349,7 @@ def evaluate_and_checkpoint_if_needed(step):
                start_seed=cfg.seed,
            )
            log_eval_info(logger, eval_info["aggregated"], step, cfg, offline_dataset, is_offline)
-            if cfg.wandb.enable:
-                logger.log_video(eval_info["video_paths"][0], step, mode="eval")
+            logger.log_video(eval_info["video_paths"][0], step, mode="eval")


Why not add a cfg.tensorboard.enable argument to support both?

Good idea! Will do.

kashyapakshay · 2024-05-14T01:00:44Z

Thanks, we should probably add option for tensorboard logging as it is quite used. Anyone knows what is the usage for these kind of logging/viz tools in the community? Is wandb more popular than tensorboard? What are the alternatives?

Both are quite popular and could be preferable in different scenarios I think. A lot of people prefer Tensorboard (including myself) because of how little overhead there is to spin up and start using it, especially when you just want to visualize loss curves and maybe some input/output samples. Even some teams who don't mind managing some infra use it (my previous ML team had Tensorboard instances hosted on K8s backed by S3 storage for artifacts). Wandb of course is a managed platform with more sophisticated workflows around tracking, hosting, sharing results, etc., but comes at a cost (although the free plan is pretty generous for small-scale experimentation). In any case, think it would be great to have Tensorboard support!

dirkmcpherson added 2 commits May 8, 2024 12:05

Log locally with tensorbaord when wandb is deactivated.

d66969e

cleanup

449d44c

dirkmcpherson changed the title ~~Log locally with tensorbaord when wandb is disabled.~~ Log locally with tensorboard when wandb is disabled. May 8, 2024

Cadene requested changes May 9, 2024

View reviewed changes

aliberts added the ✨ Enhancement New feature or request label May 12, 2024

dirkmcpherson added 2 commits May 13, 2024 10:55

Merge branch 'main' into local_logging_tensorboard

ccf2782

Allow for simultaneous wandb and tensorboard logging.

ba886ed

dirkmcpherson requested a review from Cadene May 17, 2024 22:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log locally with tensorboard when wandb is disabled. #154

Log locally with tensorboard when wandb is disabled. #154

dirkmcpherson commented May 8, 2024 •

edited by alexander-soare

Cadene left a comment

Cadene May 9, 2024

dirkmcpherson May 10, 2024

kashyapakshay commented May 14, 2024 •

edited

Log locally with tensorboard when wandb is disabled. #154

Are you sure you want to change the base?

Log locally with tensorboard when wandb is disabled. #154

Conversation

dirkmcpherson commented May 8, 2024 • edited by alexander-soare

What this does

How it was tested

How to checkout & try? (for the reviewer)

Cadene left a comment

Choose a reason for hiding this comment

Cadene May 9, 2024

Choose a reason for hiding this comment

dirkmcpherson May 10, 2024

Choose a reason for hiding this comment

kashyapakshay commented May 14, 2024 • edited

dirkmcpherson commented May 8, 2024 •

edited by alexander-soare

kashyapakshay commented May 14, 2024 •

edited