Coding/Image
한글문장 OCR
linguana
2021. 5. 13. 15:28
메인 논문
parksunwoo/ocr_kor: 딥러닝을 활용한 한글문장 OCR 연구 (github.com) 박선우
parksunwoo/ocr_kor
딥러닝을 활용한 한글문장 OCR 연구. Contribute to parksunwoo/ocr_kor development by creating an account on GitHub.
github.com
HCLT2019_deeplearningOCR.pdf
1.23MB
핵심: ocr_kor/deep-text-recognition-benchmark/modules/
변환 | 추출 | 시퀀스 | 예측모델 |
TPS | VGG | BiLSTM | Attn |
1. 변환 (TPS)
CRNN-with-STN/spatial_transformer.py at master · sbillburg/CRNN-with-STN (github.com)
Spatial Transformer Networks(STN) and its implementation | by Shanlin Sun | Medium
The TPS transformation is configured by a set of fiducial points, whose coordinates are regressed by a convolutional neural network.
2. 추출 (VGG)
3. 시퀀스 (BiLSTM)
4. 예측 (Attn)
pytorch:Spatial Transformer Networks Tutorial : 네이버 블로그 (naver.com)