Frustum check in simd

kzhsw · December 25, 2024, 2:46am

For bounding boxed aligned to 4 floats (16 bytes)
Depends on cglm

AVX Impl

CGLM_INLINE
bool
avx_aabb_frustum(vec4 min, vec4 max, vec4 planes[6]) {
  float dp;
  int    i;
  glmm_128 minv, maxv, plane, zero, sign, tmp;
  minv = glmm_load(min);
  maxv = glmm_load(max);
  minv[3] = 1.f;
  maxv[3] = 1.f;
  zero = glmm_set1(0.f);

  for (i = 0; i < 6; i++) {
    plane = glmm_load(planes[i]);
    sign = plane > zero;
    tmp = _mm_blendv_ps(minv, maxv, sign);
    dp = glmm_dot(plane, tmp);

    if (dp < 0)
      return false;
  }

  return true;
}

Wasm Impl

bool
aabb_frustum(vec4 min, vec4 max, vec4 planes[6]) {
  float dp;
  int    i;
  glmm_128 minv, maxv, plane, zero, sign, tmp;
  minv = glmm_load(min);
  maxv = glmm_load(max);
  minv = wasm_f32x4_replace_lane(minv, 3, 1.0f);
  maxv = wasm_f32x4_replace_lane(maxv, 3, 1.0f);
  zero = glmm_set1(0.f);

  for (i = 0; i < 6; i++) {
    plane = glmm_load(planes[i]);
    sign = wasm_f32x4_gt(plane, zero);
    tmp = wasm_v128_bitselect(maxv, minv, sign);
    dp = glmm_dot(plane, tmp);

    if (dp < 0)
      return false;
  }

  return true;
}

Wasm Impl with deps (loop unrolled):

The same algo in scalar js

three.js/src/math/Frustum.js at 05dbc5d9f24d290a80173b218b7b8535015674df · mrdoob/three.js · GitHub

Evgeni_Popov · December 25, 2024, 2:43pm

To make it worthwhile, you’d have to calculate all the bounding boxes at once in a loop, as I think there are hidden costs to using/calling wasm code (?).

Also, I’m not sure we currently have a frustum check bottleneck, but that may depend on your scene.

kzhsw · December 26, 2024, 1:04am

Yes, if calling wasm once for each mesh, the algo improvement might not cover the cost of js-wasm calls, like context switching, and call conversions. But it would worth if doing in batches, like, call per 1k or 2k meshes.
Also, since it only need min/max in world space, instead of all 8 corners, the transformation of bounding boxes can be simplified, like cglm’s impl:

github.com

recp/cglm/blob/054b2df0048e655b15bbf5621316c1baba20a66b/include/cglm/box.h#L25


      
          
          /*!
           * @brief apply transform to Axis-Aligned Bounding Box
           *
           * @param[in]  box  bounding box
           * @param[in]  m    transform matrix
           * @param[out] dest transformed bounding box
           */
          CGLM_INLINE
          void
          glm_aabb_transform(vec3 box[2], mat4 m, vec3 dest[2]) {
            vec3 v[2], xa, xb, ya, yb, za, zb;
          
            glm_vec3_scale(m[0], box[0][0], xa);
            glm_vec3_scale(m[0], box[1][0], xb);
          
            glm_vec3_scale(m[1], box[0][1], ya);
            glm_vec3_scale(m[1], box[1][1], yb);
          
            glm_vec3_scale(m[2], box[0][2], za);
            glm_vec3_scale(m[2], box[1][2], zb);

It’s strightforward to convert this impl to simd by replacing all vec3 to vec4 (and align pointers), so transformation and check can all be in simd, this should give more performance boost.

This can at least make thinInstanceRefreshBoundingInfo faster with a lot of thin instances.

Topic		Replies	Views
Maybe bug about BoundingBox.IsInFrustom Bugs math	3	270	May 11, 2023
RotationYawPitchRollToRef in SIMD Off topic quaternion , simd	5	413	March 22, 2023
ComposeToRef in SIMD Off topic math , matrix , simd	0	229	November 21, 2023
Rigged Meshes, Bounding Boxes and Frustum Culling Questions	3	561	October 14, 2020
Check if Vector3 is in frustum Questions	2	1729	February 19, 2020

Frustum check in simd

Related topics