tvm
|
Functions | |
tvm::te::Tensor | dense_cuda (const Target &target, const tvm::te::Tensor &data, const tvm::te::Tensor &weight, const tvm::te::Tensor &bias, const DataType &out_dtype) |
Implementation of dense for CUDA backend. More... | |
Schedule | schedule_dense (const Target &target, const Array< Tensor > &outs) |
Create a CUDA schedule for dense. More... | |
Schedule | schedule_injective_from_existing (Schedule sch, const Tensor &out) |
Updates an existing schedule for the given injective ops. More... | |
Schedule | schedule_injective (const Target &target, const Array< Tensor > &outs) |
Create a CUDA schedule for the given output tensors. More... | |
Schedule | schedule_pool (const Target &target, const Array< Tensor > &outs) |
Create a CUDA schedule for pool. More... | |
Schedule | schedule_global_pool (const Target &target, const Array< Tensor > &outs) |
Create a CUDA schedule for global_pool. More... | |
Schedule | ScheduleReduce (const Target &target, Operation op, Schedule sch, bool is_idx_reduce=false) |
Schedule a given reduce operation. More... | |
void | TraverseBeforeReduce (Schedule s, Operation op) |
Recursively traverse operator inputs, setting injective inputs to be computed inline. More... | |
void | TraverseAfterReduce (const Target &target, Schedule s, Operation op) |
Schedule a reduce op, then invoke TraverseBeforeReduce on each of the op's inputs. More... | |
Schedule | schedule_reduce (const Target &target, Array< Tensor > outs) |
Create a CUDA schedule for a reduce operation. More... | |
Schedule | schedule_softmax (const Target &target, const Array< Tensor > &outs) |
Create a CUDA schedule for the given softmax output tensors. More... | |
|
inline |
Implementation of dense for CUDA backend.
target | The target device |
data | Tensor with shape [batch, in_dim] |
weight | Tensor with shape [out_dim, in_dim] |
bias | Tensor with shape [out_dim]. Optional; to omit bias, pass Tensor() |
out_dtype | Output data type. Used for mixed precision. |
|
inline |
Create a CUDA schedule for dense.
target | The target to generate a schedule for. |
outs | The output tensors. |
|
inline |
Create a CUDA schedule for global_pool.
target | The target to generate a schedule for. |
outs | The output tensors. |
|
inline |
Create a CUDA schedule for the given output tensors.
target | The target to generate a schedule for. |
outs | The output tensors. |
|
inline |
Updates an existing schedule for the given injective ops.
sch | The schedule to update. |
out | The tensor representing the injective op. |
|
inline |
Create a CUDA schedule for pool.
target | The target to generate a schedule for. |
outs | The output tensors. |
Create a CUDA schedule for a reduce operation.
target | The target to generate a schedule for. |
outs | The output tensors. |
|
inline |
Create a CUDA schedule for the given softmax output tensors.
target | The target to generate a schedule for. |
outs | The output tensors. |
Schedule tvm::topi::cuda::ScheduleReduce | ( | const Target & | target, |
Operation | op, | ||
Schedule | sch, | ||
bool | is_idx_reduce = false |
||
) |
Schedule a given reduce operation.
target | The target to generate a schedule for. |
op | The operation representing the injective operation. |
sch | The schedule to apply this scheduling to |
is_idx_reduce | Pass true to schedule a reduce op that returns an index, such as argmax or argmin. |
Schedule a reduce op, then invoke TraverseBeforeReduce on each of the op's inputs.
target | The target to generate a schedule for. |
s | The schedule we are building |
op | The reduce op |