'Coding/Image' 카테고리의 글 목록 (2 Page)

YOLO PyTorch Implementation

Architecture: 24 Conv + 2 FC layers (참고로 fast YOLO 같은 경우에는 9 conv + 2 FC layers) 1. model.py 코드: 더보기 """ Implementation of Yolo (v1) architecture with slight modification with added BatchNorm. """ import torch import torch.nn as nn """ Information about architecture config: Tuple is structured by (kernel_size, filters, stride, padding) "M" is simply maxpooling with stride 2x2 and kernel 2x2 List..

Coding/Image 2021. 6. 14. 21:30

ICDAR2019 Tutorials on OCR

한 번 쭉 훑어본 소감 장점 - 역사적인 발전 단계를 알 수 있다 - 이론적인 배경을 알기 좋다. - PyTorch에서 구현된 torchmore이란 라이브러리로 간결한 코딩을 할 수 있음을 알게 된다. 단점 - 현대 딥러닝 적용 전의 기술들을 이해하는 건 머리가 너무 아프면서도 실무적으로 도움이 안 된다. - 여타 다른 튜토리얼과 마찬가지로 여러 기법이나 issue를 알게되지만, 결국 구체적이진 않아서... 응용은 나의 몫이다. → "하.. 그냥 하던 거 하자"라는 생각이 든다. 좋은 리뷰지만 추가적으로 건진 건 없다. [1] Tutorials – ICDAR2019 Tutorials – ICDAR2019 GMPRDIA: Graph-based Methods in Pattern Recognition and ..

Coding/Image 2021. 6. 10. 10:06

heatmap

backbone_layer = model.get_layer("layer_name") from tensorflow.keras.applications.vgg16 import VGG16, preprocess_input, decode_predictions from tensorflow.keras.models import Model from tensorflow.keras.preprocessing import image import numpy as np import cv2 import matplotlib.pyplot as plt import tensorflow.keras.backend as K import tensorflow as tf # img_path = "D:\python\pjt_odo/2주차\Test_imag..

Coding/Image 2021. 6. 9. 18:12

smote

데이터 증강기법 키워드 데이터가 불균형할 때 쓰는 기법 (일반적인 데이터 증폭에도 사용됨) knn: 데이터가 만약 이진분류하는 거다, 하나는 데이터가 수없이 많고 하나는 두 개 밖에 없다고 하자. accuracy로 하면 x로만 찍으면 잘 나오니까 F score로 하는 거고, knn 알고리즘 (k-nearest) 유사한 놈들 생성시키기. 수량을 비슷하게 맞춘 뒤에 다시 acc로 판단하기. imagedatagenerator도 되긴 하지만 smote도 좋다. 대회 이전에 미리 증강법을 통해서 데이터 뿔리고 모델 돌리고 실데이터 나오면 그 때 모델 맞춰가는 것임. 대회 시작 전부터 미리 작업 들어가야 함.

Coding/Image 2021. 6. 9. 12:56

FOTS

한국어 텍스트 탐지 및 인식을 위한 프로젝트 (딥러닝을 활용한 OCR 혹은 STR) 문제 정의: 딥러닝을 활용하여 이미지 형식으로 존재하는 텍스트를 탐지하고 인식하는 프로젝트. 이미지 내부에서 존재하는 텍스트는 다양한 변형(가령 폰트, 크기, 비율, 기울기 등)이 존재하므로 어려운 과제이다. 또한 이는 컴퓨터 비젼과 NLP를 합친 문제이기 때문에 실생활에서 여러 응용이 가능하다. (자율주행, 풍경 이해, 문서 분석 등) 좀 시간 걸리더라도 [2] 참고하여 FOTS ipynb 이해하고 돌려보기. Table of Contents 6개 쥬피터노트북 Recognition (5) data_generator (2) detection_icdar (4) detection_synthtext (3) fots_eda (1)..

Coding/Image 2021. 6. 8. 18:14

Affine Transformation

Reference: [1] Affine transformation[아핀변환] : 네이버 블로그 (naver.com) [2] What are affine transformations? - YouTube Good: very intuitive [3] fots fast oriented text spotting with a unified network - YouTube

Coding/Image 2021. 6. 8. 17:53

ICDAR 2019

Downloads - ICDAR 2019 Robust Reading Challenge on Multi-lingual scene text detection and recognition - Robust Reading Competition (uab.es) Note that this task only requires localization results (as indicated in results format in the tasks page), but the ground truth also provides the script id of each bounding box and the transcription. This extra information will be needed in Tasks 3 and 4. Ex..

Coding/Image 2021. 6. 8. 16:41

[DMQA Open Seminar] Scene Text Detection and Recognition

What is Scene Text Detection and Recognition? 이미지 인식 연구의 응용 분야: 일상적인 풍경 이미지에서 글자가 있는 영역을 탐지하고 컴퓨터 글자로 변환하는 과제 예> 이미지 번역, 차량 번호판, 이정표, 명함 인식, 이미지 검색 등 실생활에서 다양하게 활용되고 있음 OCR vs STR OCR STR 배경 단순함 복잡함 글씨체 규칙적 다양함 글자 배열 수평 각도 및 구도가 다양함 색 단조로움 다채로움 Challenges (1) Arbitrary-oriented/Multi-oriented text, (2) Occlusion (겹쳐져 있는 문자), (3) Curved text format RECT (Rectangle) RBOX (Rotated Box) QUAD (Quadri..

Coding/Image 2021. 6. 8. 12:46

linguana

고정 헤더 영역

메뉴 레이어

메뉴 리스트

검색 레이어

검색 영역

Coding/Image

추가 정보

인기글

최신글

페이징

티스토리툴바