Respect PyTorch precision when loading checkpoints by taivu1998 · Pull Request #940 · Physical-Intelligence/openpi

taivu1998 · 2026-05-10T11:23:03Z

Summary

Fixes #788.

This PR makes PyTorch checkpoint loading respect TrainConfig.pytorch_training_precision instead of silently falling back to the nested model config's default dtype.

Root Cause

scripts/train_pytorch.py already updates the model config dtype before constructing PI0Pytorch, but BaseModelConfig.load_pytorch did not do the same. That meant a config requesting float32 PyTorch precision could still instantiate and load a PyTorch checkpoint through a bfloat16-configured module. The policy-loading path also unconditionally re-applied "bfloat16" after loading, which would keep float32 policy loading broken even if the loader was fixed.

Changes

Derive a non-mutating PyTorch model config in load_pytorch with dtype=train_config.pytorch_training_precision before loading safetensors.
Add an explicit early error for model types that are not supported by PI0Pytorch.
Make PyTorch policy loading re-apply the configured precision instead of hard-coding "bfloat16".
Add focused monkeypatched tests for both bfloat16 and float32 loader precision propagation, unsupported model-type rejection, and policy post-load precision forwarding.

Validation

.venv/bin/python -m pytest src/openpi/models/model_test.py -k pytorch
.venv/bin/python -m pytest src/openpi/policies/policy_test.py -k pytorch
uvx ruff check src/openpi/models/model.py src/openpi/policies/policy_config.py src/openpi/models/model_test.py src/openpi/policies/policy_test.py
uvx ruff format --check src/openpi/models/model.py src/openpi/policies/policy_config.py src/openpi/models/model_test.py src/openpi/policies/policy_test.py
git diff --check

Note: on macOS arm64, the normal uv run path attempts to install Linux-only CUDA JAX wheels from the project lockfile. I validated using a temporary environment synced with the CUDA-only packages skipped, then removed the generated environment and caches.

wadeKeith

Clean fix for #788. Respects pytorch_training_precision during checkpoint loading instead of silently falling back to default dtype. Good test coverage. LGTM! Reviewed by Hermes Agent.

Respect PyTorch precision when loading checkpoints

d685be4

taivu1998 marked this pull request as ready for review May 11, 2026 03:37

taivu1998 requested review from Michael-Equi, jimmyt857 and kvablack as code owners May 11, 2026 03:37

jimmyt857 removed their request for review May 11, 2026 04:08

wadeKeith reviewed May 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect PyTorch precision when loading checkpoints#940

Respect PyTorch precision when loading checkpoints#940
taivu1998 wants to merge 1 commit into
Physical-Intelligence:mainfrom
taivu1998:tdv/issue-788-pytorch-dtype

taivu1998 commented May 10, 2026

Uh oh!

wadeKeith left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

taivu1998 commented May 10, 2026

Summary

Root Cause

Changes

Validation

Uh oh!

wadeKeith left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants