somatosmpl / docs /EXPLAINABILITY.md
zirobtc's picture
Upload folder using huggingface_hub
bd95c9c verified
Field | Response
:-----|:--------
Intended Task/Domain: | 3D Human Body Modeling β€” parametric body shape and pose synthesis for computer vision, animation, robotics, and human simulation.
Model Type: | Analytical / Parametric (Pose) pipeline. Core components are closed-form (no learned neural network layers in the primary forward pass). Optional: shallow two layer MLP for pose-dependent surface correctives.
Intended Users: | Computer vision researchers; graphics and animation engineers; machine learning engineers; robotics researchers and companies.
Output: | 3D mesh vertices `(B, N_h, 3)` in meters and joint positions `(B, 77, 3)` in meters. Deterministic given fixed inputs β€” no sampling or stochastic components.
Describe how the model works: | SOMA processes identity and pose inputs through four sequential analytical stages: (1) **Barycentric Topology Transfer** β€” a pre-computed sparse barycentric correspondence matrix maps the source model's rest-shape mesh to SOMA's canonical 18,095-vertex topology in O(V_h) time via a single sparse matrix-vector product. (2) **RBF Skeleton Fitting** β€” Radial Basis Function (RBF) regression with Kabsch rotation alignment adapts all 77 joint transforms to the new identity's rest shape in one linear solve per identity, recovering anatomically correct joint positions and orientations. (3) **Linear Blend Skinning (LBS)** β€” standard LBS drives the canonical mesh to the target pose given axis-angle or rotation matrix pose parameters; a GPU-accelerated path via NVIDIA Warp processes batches at > 7,000 meshes/second on an A100. All four stages are fully differentiable.
Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: | None
Technical Limitations & Mitigation: | (1) **LBS surface artifacts** β€” standard LBS produces known candy-wrapper and volume-loss artifacts at joints under extreme flexion (e.g., elbow > 120Β°, shoulder abduction > 90Β°). Mitigation: optional pose-dependent corrective MLP reduces these artifacts. (2) **Fixed topology** β€” SOMA uses a fixed canonical mesh and 77-joint skeleton; it cannot dynamically adapt to application-specific topologies without re-running offline registration. (3) **Backend engineering overhead** β€” adding a new identity backend requires authoring a `BaseIdentityModel` wrapper and performing a one-time offline SOMA wrap mesh registration.
Verified to have met prescribed NVIDIA quality standards: | Yes.
Performance Metrics: | Not applicable.
Potential Known Risks: | (1) **Out-of-distribution shape inputs** β€” identity coefficients far outside the training shape space distribution may produce physically implausible body meshes (intersecting limbs, extreme proportions). No clamping is applied; downstream applications should validate output geometry. (2) **LBS artifacts under extreme poses** β€” highly non-rigid poses may produce surface artifacts that are visually unrealistic; applications requiring photorealistic rendering under extreme motion should pair SOMA with a pose corrective or neural rendering module. (3) **Misuse for synthetic identity generation** β€” SOMA can generate arbitrary human body shapes; developers should not use it to impersonate specific real individuals without consent.
Licensing: | [Apache 2.0](../../LICENSE)