解决TensorFlow GPU版出现OOM错误

89 阅读 0 评论 59 点赞

我是靠谱客的博主执着鲜花，这篇文章主要介绍解决TensorFlow GPU版出现OOM错误，现在分享给大家，希望可以做个参考。

问题：

在使用mask_rcnn预测自己的数据集时，会出现下面错误：

复制代码

ResourceExhaustedError: OOM when allocating tensor with shape[1,512,1120,1120] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
	 [[{{node rpn_model/rpn_conv_shared/convolution}} = Conv2D[T=DT_FLOAT, data_format="NCHW", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](fpn_p2/BiasAdd, rpn_conv_shared/kernel/read)]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

[[{{node roi_align_mask/strided_slice_17/_4277}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_3068_roi_align_mask/strided_slice_17", tensor_type=DT_INT32, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

1
2
3
4
5
6
ResourceExhaustedError: OOM when allocating tensor with shape[1,512,1120,1120] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
	 [[{{node rpn_model/rpn_conv_shared/convolution}} = Conv2D[T=DT_FLOAT, data_format="NCHW", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](fpn_p2/BiasAdd, rpn_conv_shared/kernel/read)]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

	 [[{{node roi_align_mask/strided_slice_17/_4277}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_3068_roi_align_mask/strided_slice_17", tensor_type=DT_INT32, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info.

原因：

一是、因为图片尺寸为3200*4480，图片的尺寸太大。

二是、我使用的是TensorFlow GPU版，而我GPU的显存只有8G，导致显存不够。

解决：

一是、将图片尺寸改小，小到占用的内存比显存。

二是、不使用GPU进行预测，只使用CPU预测，因为一般CPU内存要大于显存的。但装的又是GPU版的TensorFlow，所以需要在预测程序进行更改。程序在前两行加入下面代码：

复制代码

1
2
import os
os.environ["CUDA_VISIBLE_DEVICES"] = ""

引号里填的是GPU的序号，不填的时候代表不使用GPU。

最后

以上就是执着鲜花最近收集整理的关于解决TensorFlow GPU版出现OOM错误的全部内容，更多相关解决TensorFlow内容请搜索靠谱客的其他文章。

本图文内容来源于网友提供，作为学习参考使用，或来自网络收集整理，版权属于原作者所有。

本文分类：机器学习个人笔记
浏览次数：89 次浏览
发布日期：2023-10-23 01:36:16
本文链接：https://www.kaopuke.com/article/k-p-k_13_u_23_o_22_fz_13__7__10_y.html

解决TensorFlow GPU版出现OOM错误

问题：

原因：

解决：

最后

评论列表共有 0 条评论

发表评论取消回复

解决TensorFlow GPU版出现OOM错误

问题：

原因：

解决：

最后

相关文章

评论列表共有 0 条评论

发表评论 取消回复

微信扫一扫：分享

发表评论取消回复