Class Instance

TVM runtime instance.

All objects(NDArray, Module, PackedFunc) returned by TVM runtim function call and PackedFunc instance are tracked through a scope mechanism that will get auto-released when we call EndScope.

This is necessarily to be able to release the underlying WASM and WebGPU memory that are not tracked through JS native garbage collection mechanism.

This does mean that we have to get familar with the following functions:

Hierarchy

Instance

Implements

Disposable

Index

Constructors

constructor

new Instance(wasmModule, importObject?, wasmInstance?, env?): Instance
Constructor

importObject can also be a LibraryProvider object, a WASI object, or an object containing wasmLibraryProvider field.

See
Please use the async version instantiate when targeting browsers.
Parameters
- wasmModule: Module
  
  The input module or instance.
- importObject: Record<string, any> = {}
  
  The imports to initialize the wasmInstance if it is not provided.
- Optional wasmInstance: Instance
  
  Additional wasm instance argument for deferred construction.
- Optional env: Environment
  
  Directly specified environment module.
Returns Instance
- Defined in runtime.ts:1059

Properties

cacheMetadata

cacheMetadata: Record<string, any> = {}

exports

exports: Record<string, Function>

memory

memory: Memory

Methods

applyPresenceAndFrequencyPenalty

applyPresenceAndFrequencyPenalty(logits, token_ids, token_freqs, presence_penalty, frequency_penalty): any
Apply presence and frequency penalty. This is an inplace operation.
Parameters
- logits: NDArray
  
  The input logits before penalty.
- token_ids: NDArray
  
  The appeared token ids.
- token_freqs: NDArray
  
  The number of times each token has appeared since last PrefillStep. token_freqs[i] is the frequency of token_ids[i], for all i. And all token_freqs should be >= 1.
- presence_penalty: number
  
  The penalty factor.
- frequency_penalty: number
  
  The penalty factor.
Returns any
- Defined in runtime.ts:1833

applyRepetitionPenalty

applyRepetitionPenalty(logits, token_ids, penalty): any
Apply repetition penalty to the logits.
Parameters
- logits: NDArray
  
  The input logits before penalty.
- token_ids: NDArray
  
  The appeared token ids.
- penalty: number
  
  The penalty factor.
Returns any
- Defined in runtime.ts:1820

applySoftmaxWithTemperature

applySoftmaxWithTemperature(logits, temperature): any
Apply softmax with temperature to the logits.
Parameters
- logits: NDArray
  
  The input logits before softmax w/ temperature.
- temperature: number
  
  The temperature factor.
Returns any
- Defined in runtime.ts:1850

asyncLoadWebGPUPipelines

asyncLoadWebGPUPipelines(mod): Promise<void>
Asynchronously load webgpu pipelines when possible.
Parameters
- mod: Module
  
  The input module.
Returns Promise<void>
- Defined in runtime.ts:2087

asyncifyEnabled

asyncifyEnabled(): boolean
Check whether we enabled asyncify mode

Returns boolean
The asynctify mode toggle
- Defined in runtime.ts:1127

attachToCurrentScope

attachToCurrentScope<T>(obj): T
Attach a detached obj to the auto-release pool of the current scope.

Note
Normally user do not need to call this function explicitly, as all library call return values are explicitly attached to the current scope. You only need to do so when you call detachFromCurrentScope to create a detached object.
Type Parameters
- T extends Disposable
Parameters
- obj: T
  
  The input obj.
Returns T
- Defined in runtime.ts:1195

beginScope

beginScope(): void
Begin a new scope for tracking object disposal.

Returns void
- Defined in runtime.ts:1154

benchmark

benchmark(run, dev, number?, repeat?): Promise<number[]>
Benchmark stable execution of the run function.

Params
run The run function

Params
dev The device to sync during each run.

Number
The number of times to compute the average.

Repeat
The number of times to repeat the run.
Parameters
- run: (() => void)
  - - (): void
    - Returns void
- dev: DLDevice
- number: number = 10
- repeat: number = 1
Returns Promise<number[]>
- Defined in runtime.ts:1102

bindCanvas

bindCanvas(canvas): void
Bind canvas to the current WebGPU context
Parameters
- canvas: HTMLCanvasElement
  
  The canvas.
Returns void
- Defined in runtime.ts:1858

clearCanvas

clearCanvas(): void
Clear canvas

Returns void
- Defined in runtime.ts:1889

concatEmbeddings

concatEmbeddings(embeddings): NDArray
Join a sequence of NDArrays that represent embeddings.
Parameters
- embeddings: NDArray[]
Returns NDArray
An NDArray of shape (\sum_{i} {m}, hidden_size)
- Defined in runtime.ts:1926

cpu

cpu(deviceId?): DLDevice
Create a new cpu DLDevice
Parameters
- deviceId: number = 0
  
  The device index.
Returns DLDevice
- Defined in runtime.ts:1700

createVirtualMachine

createVirtualMachine(dev): VirtualMachine
Setup a virtual machine module with given device.
Parameters
- dev: DLDevice
  
  DLDevice the device.
Returns VirtualMachine
The created virtual machime.
- Defined in runtime.ts:1373

detachFromCurrentScope

detachFromCurrentScope<T>(obj): T
Detach the object from the current scope so it won't be released via auto-release during endscope.

User needs to either explicitly call obj.dispose(), or attachToCurrentScope to re-attach to the current scope.

This function can be used to return values to the parent scope.
Type Parameters
- T extends Disposable
Parameters
- obj: T
  
  The object.
Returns T
- Defined in runtime.ts:1222

device

device(deviceType, deviceId?): DLDevice
Create a new DLDevice
Parameters
- deviceType: string | number
  
  The device type.
- deviceId: number = 0
  
  The device index.
Returns DLDevice
The created device.
- Defined in runtime.ts:1692

dispose

dispose(): void
Dispose the internal resource This function can be called multiple times, only the first call will take effect.

Returns void
Implementation of Disposable.dispose
- Defined in runtime.ts:1131

empty

empty(shape, dtype?, dev?): NDArray
Create an empty NDArray with given shape and dtype.
Parameters
- shape: number | number[]
  
  The shape of the array.
- dtype: string | DLDataType = "float32"
  
  The data type of the array.
- dev: DLDevice = ...
  
  The device of the ndarray.
Returns NDArray
The created ndarray.
- Defined in runtime.ts:1720

endScope

endScope(): void
End a scope and release all created TVM objects under the current scope.

Exception: one can call moveToParentScope to move a value to parent scope.

Returns void
- Defined in runtime.ts:1165

fetchNDArrayCache

fetchNDArrayCache(ndarrayCacheUrl, device, cacheScope?, cacheType?, signal?): Promise<any>
Given cacheUrl, search up items to fetch based on cacheUrl/ndarray-cache.json
Parameters
- ndarrayCacheUrl: string
  
  The cache url.
- device: DLDevice
  
  The device to be fetched to.
- cacheScope: string = "tvmjs"
  
  The scope identifier of the cache
- cacheType: string = "cache"
  
  The type of the cache: "cache" or "indexedDB"
- Optional signal: AbortSignal
  
  An optional AbortSignal to abort the fetch
Returns Promise<any>
The meta data
- Defined in runtime.ts:1462

getGlobalFunc

getGlobalFunc(name): PackedFunc
Get global PackedFunc from the runtime.
Parameters
- name: string
  
  The name of the function.
Returns PackedFunc
The result function.
- Defined in runtime.ts:1310

getParamsFromCache

getParamsFromCache(prefix, numParams): TVMObject
Get parameters in the form of prefix_i
Parameters
- prefix: string
  
  The parameter prefix.
- numParams: number
  
  Number of parameters.
Returns TVMObject
- Defined in runtime.ts:1401

getParamsFromCacheByName

getParamsFromCacheByName(paramNames): TVMObject
Get parameters based on parameter names provided
Parameters
- paramNames: string[]
  
  Names of the parameters.
Returns TVMObject
Parameters read.
- Defined in runtime.ts:1412

initWebGPU

initWebGPU(device): void
Initialize webgpu in the runtime.
Parameters
- device: GPUDevice
  
  The given GPU device.
Returns void
- Defined in runtime.ts:2150

isPackedFunc

isPackedFunc(func): boolean
Check if func is PackedFunc.
Parameters
- func: unknown
  
  The input.
Returns boolean
The check result.
- Defined in runtime.ts:1345

listGlobalFuncNames

listGlobalFuncNames(): string[]
List all the global function names registered in the runtime.

Returns string[]
The name list.
- Defined in runtime.ts:1238

makeShapeTuple

makeShapeTuple(shape): TVMObject
Create a shape tuple to pass to runtime.
Parameters
- shape: number[]
  
  The shape .
Returns TVMObject
The created shape tuple.
- Defined in runtime.ts:1960

makeString

makeString(input): TVMString
Create a TVMString that can be consumed by runtime.
Parameters
- input: string
  
  The string.
Returns TVMString
The result TVMString.
- Defined in runtime.ts:1951

makeTVMArray

makeTVMArray(inputs): TVMArray
Create an tuple TVMArray input array.

The input array can be passed to tvm runtime function and needs to b explicitly disposed.
Parameters
- inputs: TVMObjectBase[]
  
  The input array
Returns TVMArray
The result array.
- Defined in runtime.ts:1902

moveToParentScope

moveToParentScope<T>(obj): T
Move obj's attachment to the parent scope.

This function is useful to make sure objects are still alive when exit the current scope.
Type Parameters
- T extends Disposable
Parameters
- obj: T
  
  The object to be moved.
Returns T
The input obj.
- Defined in runtime.ts:1208

ndarrayCacheClear

ndarrayCacheClear(): void
Update the ndarray cache.

Returns void
- Defined in runtime.ts:1448

ndarrayCacheGet

ndarrayCacheGet(name): NDArray
Get NDArray from cache.
Parameters
- name: string
  
  The name of array.
Returns NDArray
The result.
- Defined in runtime.ts:1421

ndarrayCacheRemove

ndarrayCacheRemove(name): NDArray
Get NDArray from cache.
Parameters
- name: string
  
  The name of array.
Returns NDArray
The result.
- Defined in runtime.ts:1430

ndarrayCacheUpdate

ndarrayCacheUpdate(name, arr, override?): void
Update the ndarray cache.
Parameters
- name: string
  
  The name of the array.
- arr: NDArray
  
  The content.
- override: boolean = false
Returns void
- Defined in runtime.ts:1439

registerAsyncServerFunc

registerAsyncServerFunc(name, func, override?): void
Register an asyncfunction to be global function in the server.

Note
The async function will only be used for serving remote calls in the rpc These functions contains explicit continuation
Parameters
- name: string
  
  The name of the function.
- func: Function
  
  function to be registered.
- override: boolean = false
  
  Whether overwrite function in existing registry.
Returns void
- Defined in runtime.ts:2060

registerAsyncifyFunc

registerAsyncifyFunc(name, func, override?): void
Register async function as asynctify callable in global environment.

Note
This function is handled via asynctify mechanism The wasm needs to be compiled with Asynctify
Parameters
- name: string
  
  The name of the function.
- func: ((...args) => Promise<any>)
  
  function to be registered.
  - - (...args): Promise<any>
    - Parameters
      
      Rest ...args: any[]
      
      Returns Promise<any>
- override: boolean = false
  
  Whether overwrite function in existing registry.
Returns void
- Defined in runtime.ts:2041

registerFunc

registerFunc(name, func, override?): void
Register function to be global function in tvm runtime.
Parameters
- name: string
  
  The name of the function.
- func: Function | PackedFunc
- override: boolean = false
  
  Whether overwrite function in existing registry.
Returns void
- Defined in runtime.ts:1277

registerInitProgressCallback

registerInitProgressCallback(cb): void
Register a call back for fetch progress.
Parameters
- cb: InitProgressCallback
  
  the fetch progress callback.
Returns void
- Defined in runtime.ts:1390

registerObjectConstructor

registerObjectConstructor(typeKey, func, override?): void
Register an object constructor.
Parameters
- typeKey: string
  
  The name of the function.
- func: FObjectConstructor
  
  Function to be registered.
- override: boolean = false
  
  Whether overwrite function in existing registry.
Returns void
- Defined in runtime.ts:1997

runtimeStatsText

runtimeStatsText(): string
Obtain the runtime information in readable format.

Returns string
- Defined in runtime.ts:1143

sampleTopPFromLogits

sampleTopPFromLogits(logits, temperature, top_p): number
Sample index via top-p sampling.
Parameters
- logits: NDArray
  
  The input logits before normalization.
- temperature: number
  
  The temperature factor, will take argmax if temperature = 0.0
- top_p: number
  
  The top_p
Returns number
The sampled index.
- Defined in runtime.ts:1799

sampleTopPFromProb

sampleTopPFromProb(prob, top_p): number
Sample index via top-p sampling.
Parameters
- prob: NDArray
  
  The distribution, i.e. logits after applySoftmaxWithTemperature() is performed.
- top_p: number
  
  The top_p
Returns number
The sampled index.
- Defined in runtime.ts:1810

scalar

scalar(value, dtype): Scalar
Create a new Scalar that can be passed to a PackedFunc.
Parameters
- value: number
  
  The number value.
- dtype: string
  
  The dtype string.
Returns Scalar
The created scalar.
- Defined in runtime.ts:1682

setPackedArguments

setPackedArguments(stack, args, argsValue, argsCode): void
Set packed function arguments into the location indicated by argsValue and argsCode. Allocate new temporary space from the stack if necessary.

Parma
stack The call stack
Parameters
- stack: CachedCallStack
- args: any[]
  
  The input arguments.
- argsValue: number
  
  The offset of argsValue.
- argsCode: number
  
  The offset of argsCode.
Returns void
- Defined in runtime.ts:2300

setSeed

setSeed(seed): void
Set the seed of the internal LinearCongruentialGenerator.
Parameters
- seed: number
Returns void
- Defined in runtime.ts:1787

showImage

showImage(dataRGBA): void
Show image in canvas.
Parameters
- dataRGBA: NDArray
  
  Image array in height x width uint32 NDArray RGBA format on GPU.
Returns void
- Defined in runtime.ts:1867

systemLib

systemLib(): Module
Get system-wide library module in the wasm. System lib is a global module that contains self register functions in startup.

Returns Module
The system library module.
- Defined in runtime.ts:1231

toDLDataType

toDLDataType(dtype): DLDataType
Convert dtype to DLDataType
Parameters
- dtype: string | DLDataType
  
  The input dtype string or DLDataType.
Returns DLDataType
The converted result.
- Defined in runtime.ts:1636

toPackedFunc

toPackedFunc(func): PackedFunc
Convert func to PackedFunc
Parameters
- func: Function
  
  Input function.
Returns PackedFunc
The converted function.
- Defined in runtime.ts:1356

typeKey2Index

typeKey2Index(typeKey): number
Get type index from type key.
Parameters
- typeKey: string
  
  The type key.
Returns number
The corresponding type index.
- Defined in runtime.ts:1969

uniform

uniform(shape, low, high, dev): NDArray
Create am uniform NDArray with given shape.
Parameters
- shape: number[]
  
  The shape of the array.
- low: number
  
  The low value.
- high: number
  
  The high value.
- dev: DLDevice
  
  The device of the ndarray.
Returns NDArray
The created ndarray.
- Defined in runtime.ts:1766

webgpu

webgpu(deviceId?): DLDevice
Create a new webgpu DLDevice
Parameters
- deviceId: number = 0
  
  The device index.
Returns DLDevice
- Defined in runtime.ts:1708

withNewScope

withNewScope<T>(action): T
Perform action under a new scope.

Note
For action to return a valid value, we will need to call moveToParentScope for the objects that are created in the scope.
Type Parameters
- T
Parameters
- action: (() => T)
  
  The action function.
  - - (): T
    - Returns T
Returns T
The result value.
- Defined in runtime.ts:1179

wrapAsyncifyPackedFunc

wrapAsyncifyPackedFunc(func): AsyncPackedFunc
Wrap a function obtained from tvm runtime as AsyncPackedFunc through the asyncify mechanism

You only need to call it if the function may contain callback into async JS function via asynctify. A common one can be GPU synchronize.

It is always safe to wrap any function as Asynctify, however you do need to make sure you use await when calling the funciton.
Parameters
- func: PackedFunc
  
  The PackedFunc.
Returns AsyncPackedFunc
The wrapped AsyncPackedFunc
- Defined in runtime.ts:2024

Class Instance

Hierarchy

Implements

Index

Constructors

Properties

Methods

Constructors

constructor

See

Parameters

wasmModule: Module

importObject: Record<string, any> = {}

Optional wasmInstance: Instance

Optional env: Environment

Returns Instance

Properties

cacheMetadata

exports

memory

Methods

applyPresenceAndFrequencyPenalty

Parameters

logits: NDArray

token_ids: NDArray

token_freqs: NDArray

presence_penalty: number

frequency_penalty: number

Returns any

applyRepetitionPenalty

Parameters

logits: NDArray

token_ids: NDArray

penalty: number

Returns any

applySoftmaxWithTemperature

Parameters

logits: NDArray

temperature: number

Returns any

asyncLoadWebGPUPipelines

Parameters

mod: Module

Returns Promise<void>

asyncifyEnabled

Returns boolean

attachToCurrentScope

Note

Type Parameters

T extends Disposable

Parameters

obj: T

Returns T

beginScope

Returns void

benchmark

Params

Params

Number

Repeat

Parameters

run: (() => void)

Returns void

dev: DLDevice

number: number = 10

repeat: number = 1

Returns Promise<number[]>

bindCanvas

Parameters

canvas: HTMLCanvasElement

Returns void

clearCanvas

Returns void

concatEmbeddings

Parameters

embeddings: NDArray[]

Returns NDArray

cpu

Parameters

deviceId: number = 0

`Optional` wasmInstance: Instance

`Optional` env: Environment

`Optional` signal: AbortSignal