Class CacheState

CacheState manages domain-specific caches for the WebGPU runtime.

Currently contains:

shapeCache: Caches TVM ShapeTuple objects keyed by dimension string.
- Why: makeShapeTuple() is called on every tensor operation, crossing the JS→WASM FFI boundary each time. During LLM decode, the same shapes repeat every token (e.g. [1,32,128]), so caching avoids thousands of redundant FFI round-trips.
- Invalidation: Never. Shape tuples are immutable value objects that remain valid for the lifetime of the TVM instance.

Future additions (follow-up PR):

uniformCache: Caches GPU uniform buffers keyed by content hash.
- Why: Many dispatches use identical scalar arguments (matrix dims, etc.). Reusing the buffer avoids createBuffer + writeBuffer overhead.
- Invalidation: Must invalidate on any GPU buffer deallocation, since buffer pointers can be reused by the allocator, making cached entries that reference the old buffer stale.

Index

Constructors

Properties

Methods

dispose computeShapeKey

Constructors

constructor

new CacheState(shapeCacheSize?: number): CacheState
Parameters
- shapeCacheSize: number = 256
Returns CacheState
- Defined in cache_state.ts:149

Properties

`Readonly`shapeCache

shapeCache: LRUCache<string, Disposable>

Cache for TVM ShapeTuple objects.

Key: comma-separated dimension string, e.g. "1,32,128" Value: TVM ShapeTuple object (Disposable)

Invalidation rule: None required — shape tuples are immutable.

Methods

dispose

dispose(): void
Dispose all cached objects and clear all caches.

Returns void
- Defined in cache_state.ts:169

`Static`computeShapeKey

computeShapeKey(shape: number[]): string
Compute the cache key for a shape tuple.
Parameters
- shape: number[]
  Array of dimension values.
Returns string
String key suitable for shapeCache lookup.
- Defined in cache_state.ts:162