상세 컨텐츠

본문 제목

KakaoBrain Pororo OCR

Coding/Image

by linguana 2021. 6. 23. 16:22

본문

Pororo ocr model skim-thru 분석 결과임
Detection model: CRAFT
Recognition model:
 (1) Feature extraction: VGG or ResNet
 (2) Sequence modeling: BiLSTM or None
 (3) Prediction: CTC or Attention (Transformer under work # TODO)

 

Documentation: https://kakaobrain.github.io/pororo/miscs/ocr.html

Recognition model github: https://github.com/kakaobrain/pororo/blob/7d05a75e8062b00e6b65364b8ec6c52b6293ab07/pororo/models/brainOCR/model.py#L18

'Coding > Image' 카테고리의 다른 글

word image generator  (0) 2021.07.04
cv2 Perspective Transformation  (0) 2021.06.29
Custom Text Image Generator  (0) 2021.06.22
PIL rotate  (0) 2021.06.22
FCN (Fully Convolutional Network)  (0) 2021.06.16

관련글 더보기