`cuda` dependency in `quantem`

### Problem

Custom CUDA written in`cupy`  enables 3-10x faster computation compared to native pytorch.

For example, using CUDA/widget, it takes about ~1s to disk-VRAM load and visualize all diffraction patterns within a jupyter notebook:

![Image](https://github.com/user-attachments/assets/da1575e4-5453-4d93-8ca8-99044e126d92)

### Proposed solution

Support `cupy` as an optional dependency.

For existing functions and files, we can stick to `pytorch` internal compute and `numpy` for user-facing APIs.

For `hpc` and `widget` modules, having `cupy` can be beneficial since it saves human time and enables labs with NVIDIA GPUs to utilize `qunatem` and  their powerful hardware (a.k.a mallard ophus group)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`cuda` dependency in `quantem` #143

Problem

Proposed solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

cuda dependency in quantem #143

Description

Problem

Proposed solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`cuda` dependency in `quantem` #143