データ数と画像のスケーリングがDeep Learningの学習に与える影響

/ /

日本語AIでPubMedを検索

PubMedの提供する医学論文データベースを日本語で検索できます。AI(Deep Learning)を活用した機械翻訳エンジンにより、精度高く日本語へ翻訳された論文をご参照いただけます。

PeerJ Comput Sci.2020;6:e312. cs-312. doi: 10.7717/peerj-cs.312.Epub 2020-11-16.

データ数と画像のスケーリングがDeep Learningの学習に与える影響

Effects of data count and image scaling on Deep Learning training.

Daisuke Hirahara
Eichi Takaya
Taro Takahara
Takuya Ueda

PMID: 33816963 PMCID: PMC7924688. DOI: 10.7717/peerj-cs.312.

抄録

背景:

畳み込みニューラルネットワーク（CNN）を用いたDeep Learningは、画像を利用する様々な分野で大きな成果を上げています。ディープラーニングはデータから自動的に特徴を抽出することができますが、CNNは畳み込み処理によって画像の特徴を抽出します。私たちは、補間法を用いて画像サイズを大きくすることで、効果的な特徴抽出ができると考えました。そこで、データ数の増加に伴い補間手法がどのように変化するかを調べるために、学習用の画像データが少ない場合に、反転や回転によるデータ増強と補間による画像増強の効果を調べて比較しました。さらに、補間による画像の増強がCNNの学習に有用かどうかを明らかにした。医用画像における補間法の有用性を検証するために，胸部X線写真の性別分類データセットであるGender01データセットを用いた．補間法による画像拡大と反転・回転によるデータ拡張を比較するために，Bilinear法を用いて2倍と4倍に拡大した結果を検証した．

Background: Deep learning using convolutional neural networks (CNN) has achieved significant results in various fields that use images. Deep learning can automatically extract features from data, and CNN extracts image features by convolution processing. We assumed that increasing the image size using interpolation methods would result in an effective feature extraction. To investigate how interpolation methods change as the number of data increases, we examined and compared the effectiveness of data augmentation by inversion or rotation with image augmentation by interpolation when the image data for training were small. Further, we clarified whether image augmentation by interpolation was useful for CNN training. To examine the usefulness of interpolation methods in medical images, we used a Gender01 data set, which is a sex classification data set, on chest radiographs. For comparison of image enlargement using an interpolation method with data augmentation by inversion and rotation, we examined the results of two- and four-fold enlargement using a Bilinear method.

結果:

補間法で画像を拡大すると，平均して分類精度が向上することがわかった。最も大きく改善したのは学習データ数が100個のときで，元データを用いた学習モデルの平均分類精度は0.563であった。しかし，補間法を用いて画像サイズを4倍にすると，平均分類精度は0.715と大幅に向上した。また，Bilinear法を用いて学習したモデルは，反転や回転によるデータ増量に比べて，100個の学習データで0.095，5万個の学習データで0.015の平均分類精度の向上が見られた．また，胸部X線画像の平均分類精度を比較したところ，補間法では安定して高い平均分類精度が得られた。

Results: The average classification accuracy improved by expanding the image size using the interpolation method. The biggest improvement was noted when the number of training data was 100, and the average classification accuracy of the training model with the original data was 0.563. However, upon increasing the image size by four times using the interpolation method, the average classification accuracy significantly improved to 0.715. Compared with the data augmentation by inversion and rotation, the model trained using the Bilinear method showed an improvement in the average classification accuracy by 0.095 with 100 training data and 0.015 with 50,000 training data. Comparisons of the average classification accuracy of the chest X-ray images showed a stable and high-average classification accuracy using the interpolation method.

結論:

補間法を用いて画像サイズを大きくしてCNNを学習することは有用な方法である。今後は，様々な医用画像を用いた検証を行い，画像サイズが重要な理由をさらに明らかにしていきたい。

Conclusion: Training the CNN by increasing the image size using the interpolation method is a useful method. In the future, we aim to conduct additional verifications using various medical images to further clarify the reason why image size is important.