|
tvm
|
Functions | |
| tvm::te::Tensor | dense_cuda (const Target &target, const tvm::te::Tensor &data, const tvm::te::Tensor &weight, const tvm::te::Tensor &bias, const DataType &out_dtype) |
| Implementation of dense for CUDA backend. More... | |
| Schedule | schedule_dense (const Target &target, const Array< Tensor > &outs) |
| Create a CUDA schedule for dense. More... | |
| Schedule | schedule_injective_from_existing (Schedule sch, const Tensor &out) |
| Updates an existing schedule for the given injective ops. More... | |
| Schedule | schedule_injective (const Target &target, const Array< Tensor > &outs) |
| Create a CUDA schedule for the given output tensors. More... | |
| Schedule | schedule_pool (const Target &target, const Array< Tensor > &outs) |
| Create a CUDA schedule for pool. More... | |
| Schedule | schedule_global_pool (const Target &target, const Array< Tensor > &outs) |
| Create a CUDA schedule for global_pool. More... | |
| Schedule | ScheduleReduce (const Target &target, Operation op, Schedule sch, bool is_idx_reduce=false) |
| Schedule a given reduce operation. More... | |
| void | TraverseBeforeReduce (Schedule s, Operation op) |
| Recursively traverse operator inputs, setting injective inputs to be computed inline. More... | |
| void | TraverseAfterReduce (const Target &target, Schedule s, Operation op) |
| Schedule a reduce op, then invoke TraverseBeforeReduce on each of the op's inputs. More... | |
| Schedule | schedule_reduce (const Target &target, Array< Tensor > outs) |
| Create a CUDA schedule for a reduce operation. More... | |
| Schedule | schedule_softmax (const Target &target, const Array< Tensor > &outs) |
| Create a CUDA schedule for the given softmax output tensors. More... | |
|
inline |
Implementation of dense for CUDA backend.
| target | The target device |
| data | Tensor with shape [batch, in_dim] |
| weight | Tensor with shape [out_dim, in_dim] |
| bias | Tensor with shape [out_dim]. Optional; to omit bias, pass Tensor() |
| out_dtype | Output data type. Used for mixed precision. |
|
inline |
Create a CUDA schedule for dense.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
|
inline |
Create a CUDA schedule for global_pool.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
|
inline |
Create a CUDA schedule for the given output tensors.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
|
inline |
Updates an existing schedule for the given injective ops.
| sch | The schedule to update. |
| out | The tensor representing the injective op. |
|
inline |
Create a CUDA schedule for pool.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
Create a CUDA schedule for a reduce operation.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
|
inline |
Create a CUDA schedule for the given softmax output tensors.
| target | The target to generate a schedule for. |
| outs | The output tensors. |
| Schedule tvm::topi::cuda::ScheduleReduce | ( | const Target & | target, |
| Operation | op, | ||
| Schedule | sch, | ||
| bool | is_idx_reduce = false |
||
| ) |
Schedule a given reduce operation.
| target | The target to generate a schedule for. |
| op | The operation representing the injective operation. |
| sch | The schedule to apply this scheduling to |
| is_idx_reduce | Pass true to schedule a reduce op that returns an index, such as argmax or argmin. |
Schedule a reduce op, then invoke TraverseBeforeReduce on each of the op's inputs.
| target | The target to generate a schedule for. |
| s | The schedule we are building |
| op | The reduce op |
1.8.13