PatchAlign3D: Local Feature Alignment for Dense 3D Shape Understanding

PatchAlign3D is an encoder-only 3D model that produces language-aligned patch-level features directly from point clouds. It enables zero-shot 3D part segmentation with fast single-pass inference without requiring test-time multi-view rendering.

Sample Usage

You can run inference on a single shape and save per-point predictions using the following command from the official repository:

python patchalign3d/inference/infer.py \
  --ckpt /path/to/stage2_last.pt \
  --input /path/to/shape.npz \
  --labels "seat,back,leg,arm"

Citation

@misc{hadgi2026patchalign3dlocalfeaturealignment,
  title={PatchAlign3D: Local Feature Alignment for Dense 3D Shape understanding},
  author={Souhail Hadgi and Bingchen Gong and Ramana Sundararaman and Emery Pierson and Lei Li and Peter Wonka and Maks Ovsjanikov},
  year={2026},
  eprint={2601.02457},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2601.02457},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for patchalign3d/patchalign3d-encoder