Fix cuBLAS arguments for fp16 dot #3660

apaszke · 2017-11-12T15:21:41Z

Result type has to be fp16 for fp16 dot. See docs of cublasDotEx (look for "datatypes combinations currrently supported").

aten/src/THC/THCBlas.cu


 #ifdef CUDA_HALF_TENSOR
-float THCudaBlas_Hdot(THCState *state, int64_t n, half *x, int64_t incx, half *y, int64_t incy)
+half THCudaBlas_Hdot(THCState *state, int64_t n, half *x, int64_t incx, half *y, int64_t incy)


apaszke · 2017-11-29T12:09:31Z

@soumith can you please review again? I added tests for CUDA half <-> CPU float comparison

soumith · 2017-11-29T15:16:41Z

looks good!

The test_cuda.py setup purports to test half tensors, but actually just re-tests FloatTensors because the keys in type_map were str instead of type. Testing HalfTensors is more complicated, requiring changes to precision and requires excluding some unimplemented methods. We should fully test half CUDA tensors. This change just deletes the duplicate tests of FloatTensor.

* Fix cuBLAS arguments for fp16 dot * Enable FloatTensor <-> CUDA HalfTensor checks in test_cuda.py

apaszke mentioned this pull request Nov 12, 2017

Runtime error when mixing FP32 loss functions and FP16 cnn layers #3651

Closed

soumith requested changes Nov 12, 2017

View reviewed changes

Fix cuBLAS arguments for fp16 dot

2bd779c

apaszke force-pushed the half_dot_fix branch from 122f681 to 2e1f834 Compare November 24, 2017 09:58

Enable FloatTensor <-> CUDA HalfTensor checks in test_cuda.py

f17f2bf

apaszke force-pushed the half_dot_fix branch from 2e1f834 to f17f2bf Compare November 29, 2017 10:15

soumith merged commit 6ae0d47 into master Nov 29, 2017

colesbury deleted the half_dot_fix branch December 2, 2017 21:27

colesbury mentioned this pull request Feb 1, 2018

Reverts force_gpu_half changes from #3660 #5000

Merged

soumith added the 0.3.1 label Feb 4, 2018

soumith pushed a commit that referenced this pull request Feb 7, 2018

Fix cuBLAS arguments for fp16 dot (#3660)

d27c3ce

* Fix cuBLAS arguments for fp16 dot * Enable FloatTensor <-> CUDA HalfTensor checks in test_cuda.py

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix cuBLAS arguments for fp16 dot #3660

Fix cuBLAS arguments for fp16 dot #3660

Uh oh!

apaszke commented Nov 12, 2017 •

edited

Loading

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Nov 29, 2017

Uh oh!

soumith commented Nov 29, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix cuBLAS arguments for fp16 dot #3660

Fix cuBLAS arguments for fp16 dot #3660

Uh oh!

Conversation

apaszke commented Nov 12, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Nov 29, 2017

Uh oh!

soumith commented Nov 29, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

apaszke commented Nov 12, 2017 •

edited

Loading