add all files from project

2024-10-01 03:15:35 -04:00 · 2019-02-17 19:53:52 +09:00 · 2019-02-17 19:53:52 +09:00 · 2a2bfeca15
commit 2a2bfeca15
parent 175017eff3
9 changed files with 274 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -1,2 +1,275 @@
 # awesome-deep-text-detection-recognition
-A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
+A curated list of awesome deep learning based papers on text detection and recognition.
+
+<p align='center'>
+<img src = '/overall_histogram.png' height="450px">
+<img src = '/overall_pi_chart.png' height="300px">  
+</p>
+
+## Text Detection
+* Papers are sorted by published date.
+* IC is shorts for ICDAR.
+* Score is F1-score for localization task.
+  * (L) stands for score in [leader-board](http://rrc.cvc.uab.es/).
+  * If the reported score in leader-board is somewhat different from the paper, (L) is provided.
+* `*CODE` means official code and `CODE(M)` means that traiend model is provided.  
+
+<!---
+*Conf.* | *Date* | *Title* | *IC11* | *IC13* | *IC15* | *MSRA-TD500* | *IC17 MLT* | *CTW1500* | *Total Text* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
+'14-ECCV	| 14/10/07	| [Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees](	http://www.whuang.org/papers/whuang2014_eccv.pdf) |	0.78 | 
+15-CVPR	| 15/06/01	| [Symmetry-based text line detection in natural scenes](https://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Zhang_Symmetry-Based_Text_Line_2015_CVPR_paper.pdf) | 0.8 | [0.8043](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=2197) | | | | | | [`PRJ`](http://mclab.eic.hust.edu.cn/~xbai/text/symmetry/SymmetryTextLineDetection.html) <br> [`CODE`](https://github.com/stupidZZ/Symmetry_Text_Line_Detection) |  
+'16-TIP	| 15/10/12 | [Text-Attentional Convolutional Neural Networks for Scene Text Detection](https://arxiv.org/pdf/1510.03283.pdf) |	0.82	| [0.8165](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=7187)	| |	0.69 | 
+'15-ICCV	| 15/12/13	| [Text Flow : A Unified Text Detection System in Natural Scene Images](https://pdfs.semanticscholar.org/11a0/8ced22775a217ba78c566528ed44ea98e3e3.pdf)	| 0.8089	| 0.8025 | 
+'16-arXiv	| 16/03/31	| [Accurate Text Localization in Natural Image with Cascaded Convolutional TextNetwork](https://arxiv.org/pdf/1603.09423.pdf)	| 0.84	| 0.86	| |	0.71 |  
+'16-CVPR	| 16/04/14	| [Multi-Oriented Text Detection with Fully Convolutional Networks](https://arxiv.org/pdf/1604.04018.pdf) |	| 0.83	| 0.54 | 0.74 | | | | [`*TORCH(M)`](https://github.com/stupidZZ/FCN_Text)
+'16-CVPR	| 16/04/22	| [Synthetic Data for Text Localisation in Natural Images](http://www.robots.ox.ac.uk/~ankush/textloc.pdf)	| 0.843	| 0.847 <br> (L)[0.8359](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=3820) | | | | | | [`CODE`](https://github.com/ankush-me/SynthText) <br> [`DB`](http://www.robots.ox.ac.uk/~vgg/data/scenetext/)
+'16-arXiv	| 16/06/29	| [Scene Text Detection Via Holistic, Multi-Channel Prediction](https://arxiv.org/pdf/1606.09002.pdf)	| |	0.8433	| 0.6477 | 
+'16-ECCV	| 16/09/12	| [Detecting Text in Natural Image with Connectionist Text Proposal Network](https://arxiv.org/pdf/1609.03605.pdf) |	0.84	| [0.8215](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=13931)	| [0.6085](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=13930) | | | | | [`*CAFFE(M)`](https://github.com/tianzhi0549/CTPN) <br> [`CAFFE`](https://github.com/qingswu/CTPN) <br> [`TF(M)`](https://github.com/eragonruan/text-detection-ctpn) <br> [`DEMO`](http://textdet.com/) <br> [`BLOG(CH)`](http://slade-ruan.me/2017/10/22/text-detection-ctpn/)
+'17-AAAI	| 16/11/21	|[TextBoxes: A fast text detector with a single deep neural network](https://arxiv.org/pdf/1611.06779.pdf) |	0.85	| 0.85 <br> (L)[0.8767](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=21928) | | | | | | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes) <br> [`TF`](https://github.com/shinjayne/shinTB) <br> [`BLOG(KR)`](http://jaynewho.com/post/6)
+'18-TM	| 17/03/03	| [Arbitrary-Oriented Scene Text Detection via Rotation Proposals](https://arxiv.org/pdf/1703.01086.pdf)	|	| [0.9125](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=15904)	| [0.8020](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=17393)	| 0.75 | | | | [`*CAFFE`](https://github.com/mjq11302010044/RRPN)
+'17-CVPR	| 17/03/04	| [Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection](https://arxiv.org/pdf/1703.01425.pdf) | | |	[0.7064](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=13007) |
+'17-CVPR	| 17/03/19	| [Detecting Oriented Text in Natural Images by Linking Segments](https://arxiv.org/pdf/1703.06520.pdf)	|	| 0.853	| 0.75	<br> (L)[0.7636](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29245)| 0.77 | | | | [`*TF(M)`](https://github.com/bgshih/seglink) <br> [`TF(M)`](https://github.com/dengdan/seglink) <br> [`SLIDE`](http://mclab.eic.hust.edu.cn/~xbai/SpotlightPPT/TextDetection-seglink-spotlight-CVPR17.pdf) <br> [`VIDEO`](https://www.youtube.com/watch?v=w0vZWUi-m0c) |
+'17-arXiv	| 17/03/24 |	[Deep Direct Regression for Multi-Oriented Scene Text Detection](https://arxiv.org/pdf/1703.08289.pdf)	| | 0.86	| 0.81	| 0.74 | 
+'17-arXiv	| 17/04/03	| [Cascaded Segmentation-Detection Networks for Word-Level Text Spotting](https://arxiv.org/pdf/1704.00834.pdf)	|	| 0.86	| 0.71 |
+'17-CVPR	| 17/04/11	| [EAST: An Efficient and Accurate Scene Text Detector](https://arxiv.org/pdf/1704.03155.pdf) | | | 0.8072	<br> (L)[0.8038](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29855) | 0.7608 | | | | [`TF(M)`](https://github.com/argman/EAST) <br> [`TF`](https://github.com/AKSHAYUBHAT/EAST) <br> [`DEMO`](http://east.zxytim.com/) <br> [`VIDEO`](https://www.youtube.com/watch?v=o5asMTdhmvA)
+'17-ICIP	| 17/05/15	| [WordFence: Text Detection in Natural Images with Border Awareness](https://arxiv.org/pdf/1705.05483.pdf) |	0.84 |	0.86 |
+'17-arXiv	| 17/06/30	| [R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection](https://arxiv.org/pdf/1706.09579.pdf) | |	0.8773	| 0.8254 | | | | | [`TF(M)`](https://github.com/DetectionTeamUCAS/R2CNN_Faster-RCNN_Tensorflow) <br> [`CAFFE(M)`](https://github.com/beacandler/R2CNN)
+'17-CVPR	| 17/07/21	| [Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting In The Wild](http://openaccess.thecvf.com/content_cvpr_2017/papers/He_Multi-Scale_FCN_With_CVPR_2017_paper.pdf)	|	| 0.85	| 0.63 | 
+'17-arXiv	| 17/08/17	| [Deep Scene Text Detection with Connected Component Proposals](https://arxiv.org/pdf/1708.05133.pdf)	| |	0.919 | 
+'17-ICCV	| 17/08/22	| [WordSup: Exploiting Word Annotations for Character based Text Detection](https://arxiv.org/pdf/1708.06720) | |	0.9064 |	0.7816 |
+'17-ICCV	| 17/09/01	| [Single Shot Text Detector with Regional Attention](https://arxiv.org/pdf/1709.00138.pdf)	|	 | 0.8704 |	[0.7691](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=18294) | | | | | [`*CAFFE(M)`](https://github.com/BestSonny/SSTD) <br> [`PYTORCH`](https://github.com/HotaekHan/SSTDNet) <br> [`VIDEO`](https://www.youtube.com/watch?v=oBWVgz685-k) 
+'17-arXiv	| 17/09/11	| [Fused Text Segmentation Networks for Multi-oriented Scene Text Detection](https://arxiv.org/pdf/1709.03272.pdf) | | | [0.8414](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=30447)	| 0.82 | | |	0.813 |
+'17-ICCV	| 17/10/13	| [WeText: Scene Text Detection under Weak Supervision](https://arxiv.org/pdf/1710.04826.pdf)	| |	0.869 <br> (L)[0.8313](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=20665) |
+'17-ICCV	| 17/10/22	| [Self-organized Text Detection with Minimal Post-processing via Border Learning](http://openaccess.thecvf.com/content_ICCV_2017/papers/Wu_Self-Organized_Text_Detection_ICCV_2017_paper.pdf)	| 0.93	| 0.84 | | | | | | [`*KERAS(M)`](https://gitlab.com/rex-yue-wu/ISI-PPT-Text-Detector)
+'17-ICDAR	| 17/11/11	| [Deep Residual Text Detection Network for Scene Text](https://arxiv.org/pdf/1711.04147.pdf)	|	| 0.9117 <br> (L)[0.8925](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=21966) | |	0.77 |
+'18-AAAI	| 17/11/12	| [Feature Enhancement Network: A Refined Scene Text Detector](https://arxiv.org/pdf/1711.04249.pdf)	| 0.892	| [0.9161](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=28362) |
+'17-arXiv	| 17/11/30	| [ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene](https://arxiv.org/pdf/1711.11249.pdf)	|	| | 0.759 |	0.75 |
+'18-AAAI	| 18/01/04	| [PixelLink: Detecting Scene Text via Instance Segmentation](https://arxiv.org/pdf/1801.01315.pdf)	|	| 0.881 | [0.8519](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=30576)	| 0.778 | | | | [`*TF(M)`](https://github.com/ZJULearning/pixel_link) [`TF`](https://github.com/BowieHsu/tensorflow_ocr)
+'18-CVPR	| 18/01/05	| [FOTS: Fast Oriented Text Spotting with a Unified Network](https://arxiv.org/pdf/1801.01671.pdf)	| |	[0.925](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=34624)	| [0.8984](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=34625)	|	| 0.7075 | | | [`PYTORCH`](https://github.com/jiangxiluning/FOTS.PyTorch) <br> [`VIDEO`](https://www.youtube.com/watch?v=F7TTYlFr2QM) | 
+'18-TIP	| 18/01/09	| [TextBoxes++: A Single-Shot Oriented Scene Text Detector](https://arxiv.org/pdf/1801.02765.pdf)	| |	0.88	| 0.829 <br> (L)[0.8475](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29660) | | | | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes_plusplus)
+'18-CVPR	| 18/03/09	| [An end-to-end TextSpotter with Explicit Alighment and Attention](https://arxiv.org/pdf/1803.03474.pdf)	| |	0.9	| 0.87 | | | | | [`*CAFFE(M)`](https://github.com/tonghe90/textspotter)
+'18-CVPR	| 18/03/14	| [Rotation-Sensitive Regression for Oriented Scene Text Detection](https://arxiv.org/pdf/1803.05265.pdf)	| |	0.89	| 0.838 |	0.79 |
+'18-arXiv	| 18/04/08	| [Detecting Multi-Oriented Text with Corner-based Region Proposals](https://arxiv.org/pdf/1804.02690.pdf) | |	0.876	| 0.845 | | | | | [`*CAFFE(M)`](https://github.com/xhzdeng/crpn)
+'18-arXiv	| 18/04/24	| [An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches](https://arxiv.org/pdf/1804.09003.pdf)	|	| 0.92	| 0.86	| |	0.7 |
+'18-IJCAI	| 18/05/03	| [IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection](https://arxiv.org/pdf/1805.01167.pdf)	| | | [0.9047](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=34989) |	0.83 |
+'18-arXiv	| 18/06/07	| [Shape Robust Text Detection with Progressive Scale Expansion Network](https://arxiv.org/pdf/1806.02559.pdf) | | |		[0.8721](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=37493)	| |	0.7245	| 0.8117 | | [`PRJ`](https://github.com/whai362/PSENet)
+'18-ECCV	| 18/07/04	| [TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes](https://arxiv.org/pdf/1807.01544.pdf) | | |	0.826	| 0.783 | | 0.756 | 0.646 |
+'18-ECCV	| 18/07/06	| [Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes](https://arxiv.org/pdf/1807.02242.pdf) | |	0.917	| 0.86 | | | | 0.613 |
+'18-ECCV	| 18/07/10	| [Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping](https://arxiv.org/pdf/1807.03547.pdf) |	| 0.892	| |	0.801	| 0.69 |
+'19-AAAI    | 18/11/21  | [Scene Text Detection with Supervised Pyramid Context Network](https://arxiv.org/pdf/1811.08605.pdf)  | 0.921 | 0.872 |
+-->
+
+*Conf.* | *Date* | *Title* | *IC13* | *IC15* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: |
+'14-ECCV	| 14/10/07	| [Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees](	http://www.whuang.org/papers/whuang2014_eccv.pdf) |
+15-CVPR	| 15/06/01	| [Symmetry-based text line detection in natural scenes](https://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Zhang_Symmetry-Based_Text_Line_2015_CVPR_paper.pdf) | [0.8043](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=2197) | | [`PRJ`](http://mclab.eic.hust.edu.cn/~xbai/text/symmetry/SymmetryTextLineDetection.html) <br> [`CODE`](https://github.com/stupidZZ/Symmetry_Text_Line_Detection) |  
+'16-TIP	| 15/10/12 | [Text-Attentional Convolutional Neural Networks for Scene Text Detection](https://arxiv.org/pdf/1510.03283.pdf) |	[0.8165](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=7187)	| 
+'15-ICCV	| 15/12/13	| [Text Flow : A Unified Text Detection System in Natural Scene Images](https://pdfs.semanticscholar.org/11a0/8ced22775a217ba78c566528ed44ea98e3e3.pdf)	|0.8025 | 
+'16-arXiv	| 16/03/31	| [Accurate Text Localization in Natural Image with Cascaded Convolutional TextNetwork](https://arxiv.org/pdf/1603.09423.pdf)	| 0.86	| |
+'16-CVPR	| 16/04/14	| [Multi-Oriented Text Detection with Fully Convolutional Networks](https://arxiv.org/pdf/1604.04018.pdf) | 0.83 | 0.54 | [`*TORCH(M)`](https://github.com/stupidZZ/FCN_Text)
+'16-CVPR	| 16/04/22	| [Synthetic Data for Text Localisation in Natural Images](http://www.robots.ox.ac.uk/~ankush/textloc.pdf)	| 0.847 <br> (L)[0.8359](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=3820) | | [`CODE`](https://github.com/ankush-me/SynthText) <br> [`DB`](http://www.robots.ox.ac.uk/~vgg/data/scenetext/)
+'16-arXiv	| 16/06/29	| [Scene Text Detection Via Holistic, Multi-Channel Prediction](https://arxiv.org/pdf/1606.09002.pdf)	|0.8433	| 0.6477 | 
+'16-ECCV	| 16/09/12	| [Detecting Text in Natural Image with Connectionist Text Proposal Network](https://arxiv.org/pdf/1609.03605.pdf) |	[0.8215](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=13931)	| [0.6085](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=13930) | [`*CAFFE(M)`](https://github.com/tianzhi0549/CTPN) <br> [`CAFFE`](https://github.com/qingswu/CTPN) <br> [`TF(M)`](https://github.com/eragonruan/text-detection-ctpn) <br> [`DEMO`](http://textdet.com/) <br> [`BLOG(CH)`](http://slade-ruan.me/2017/10/22/text-detection-ctpn/)
+'17-AAAI	| 16/11/21	|[TextBoxes: A fast text detector with a single deep neural network](https://arxiv.org/pdf/1611.06779.pdf) |	0.85 <br> (L)[0.8767](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=21928) | | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes) <br> [`TF`](https://github.com/shinjayne/shinTB) <br> [`BLOG(KR)`](http://jaynewho.com/post/6)
+'18-TM	| 17/03/03	| [Arbitrary-Oriented Scene Text Detection via Rotation Proposals](https://arxiv.org/pdf/1703.01086.pdf)	|	[0.9125](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=15904)	| [0.8020](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=17393)	| [`*CAFFE`](https://github.com/mjq11302010044/RRPN)
+'17-CVPR	| 17/03/04	| [Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection](https://arxiv.org/pdf/1703.01425.pdf) | | [0.7064](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=13007)
+'17-CVPR	| 17/03/19	| [Detecting Oriented Text in Natural Images by Linking Segments](https://arxiv.org/pdf/1703.06520.pdf)	|	0.853	| 0.75	<br> (L)[0.7636](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29245)| [`*TF(M)`](https://github.com/bgshih/seglink) <br> [`TF(M)`](https://github.com/dengdan/seglink) <br> [`SLIDE`](http://mclab.eic.hust.edu.cn/~xbai/SpotlightPPT/TextDetection-seglink-spotlight-CVPR17.pdf) <br> [`VIDEO`](https://www.youtube.com/watch?v=w0vZWUi-m0c) |
+'17-arXiv	| 17/03/24 |	[Deep Direct Regression for Multi-Oriented Scene Text Detection](https://arxiv.org/pdf/1703.08289.pdf)	| 0.86 |  0.81	| 
+'17-arXiv	| 17/04/03	| [Cascaded Segmentation-Detection Networks for Word-Level Text Spotting](https://arxiv.org/pdf/1704.00834.pdf)	|	0.86	| 0.71 |
+'17-CVPR	| 17/04/11	| [EAST: An Efficient and Accurate Scene Text Detector](https://arxiv.org/pdf/1704.03155.pdf) | | 0.8072	<br> (L)[0.8038](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29855) | [`TF(M)`](https://github.com/argman/EAST) <br> [`TF`](https://github.com/AKSHAYUBHAT/EAST) <br> [`DEMO`](http://east.zxytim.com/) <br> [`VIDEO`](https://www.youtube.com/watch?v=o5asMTdhmvA)
+'17-ICIP	| 17/05/15	| [WordFence: Text Detection in Natural Images with Border Awareness](https://arxiv.org/pdf/1705.05483.pdf) |	0.86 |
+'17-arXiv	| 17/06/30	| [R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection](https://arxiv.org/pdf/1706.09579.pdf) | 0.8773	| 0.8254 | [`TF(M)`](https://github.com/DetectionTeamUCAS/R2CNN_Faster-RCNN_Tensorflow) <br> [`CAFFE(M)`](https://github.com/beacandler/R2CNN)
+'17-CVPR	| 17/07/21	| [Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting In The Wild](http://openaccess.thecvf.com/content_cvpr_2017/papers/He_Multi-Scale_FCN_With_CVPR_2017_paper.pdf)	|	0.85	| 0.63 | 
+'17-arXiv	| 17/08/17	| [Deep Scene Text Detection with Connected Component Proposals](https://arxiv.org/pdf/1708.05133.pdf)	| 0.919 | 
+'17-ICCV	| 17/08/22	| [WordSup: Exploiting Word Annotations for Character based Text Detection](https://arxiv.org/pdf/1708.06720) |	0.9064 |	0.7816 |
+'17-ICCV	| 17/09/01	| [Single Shot Text Detector with Regional Attention](https://arxiv.org/pdf/1709.00138.pdf)	|	0.8704 |	[0.7691](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=18294) | [`*CAFFE(M)`](https://github.com/BestSonny/SSTD) <br> [`PYTORCH`](https://github.com/HotaekHan/SSTDNet) <br> [`VIDEO`](https://www.youtube.com/watch?v=oBWVgz685-k) 
+'17-arXiv	| 17/09/11	| [Fused Text Segmentation Networks for Multi-oriented Scene Text Detection](https://arxiv.org/pdf/1709.03272.pdf) | | [0.8414](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=30447)	|
+'17-ICCV	| 17/10/13	| [WeText: Scene Text Detection under Weak Supervision](https://arxiv.org/pdf/1710.04826.pdf)	| 0.869 <br> (L)[0.8313](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=20665) |
+'17-ICCV	| 17/10/22	| [Self-organized Text Detection with Minimal Post-processing via Border Learning](http://openaccess.thecvf.com/content_ICCV_2017/papers/Wu_Self-Organized_Text_Detection_ICCV_2017_paper.pdf)	| 0.84 | | [`*KERAS(M)`](https://gitlab.com/rex-yue-wu/ISI-PPT-Text-Detector)
+'17-ICDAR	| 17/11/11	| [Deep Residual Text Detection Network for Scene Text](https://arxiv.org/pdf/1711.04147.pdf)	|	0.9117 <br> (L)[0.8925](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=21966) | 
+'18-AAAI	| 17/11/12	| [Feature Enhancement Network: A Refined Scene Text Detector](https://arxiv.org/pdf/1711.04249.pdf)	| [0.9161](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=28362) |
+'17-arXiv	| 17/11/30	| [ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene](https://arxiv.org/pdf/1711.11249.pdf)	|	| 0.759 |	0.75 |
+'18-AAAI	| 18/01/04	| [PixelLink: Detecting Scene Text via Instance Segmentation](https://arxiv.org/pdf/1801.01315.pdf)	|	0.881 | [0.8519](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=30576)	| [`*TF(M)`](https://github.com/ZJULearning/pixel_link) [`TF`](https://github.com/BowieHsu/tensorflow_ocr)
+'18-CVPR	| 18/01/05	| [FOTS: Fast Oriented Text Spotting with a Unified Network](https://arxiv.org/pdf/1801.01671.pdf)	| [0.925](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=1&m=34624)	| [0.8984](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=34625)	|	[`PYTORCH`](https://github.com/jiangxiluning/FOTS.PyTorch) <br> [`VIDEO`](https://www.youtube.com/watch?v=F7TTYlFr2QM) | 
+'18-TIP	| 18/01/09	| [TextBoxes++: A Single-Shot Oriented Scene Text Detector](https://arxiv.org/pdf/1801.02765.pdf)	| 0.88	| 0.829 <br> (L)[0.8475](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=29660) | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes_plusplus)
+'18-CVPR	| 18/03/09	| [An end-to-end TextSpotter with Explicit Alighment and Attention](https://arxiv.org/pdf/1803.03474.pdf)	| 0.9	| 0.87 |[`*CAFFE(M)`](https://github.com/tonghe90/textspotter)
+'18-CVPR	| 18/03/14	| [Rotation-Sensitive Regression for Oriented Scene Text Detection](https://arxiv.org/pdf/1803.05265.pdf)	| 0.89	| 0.838 |	
+'18-arXiv	| 18/04/08	| [Detecting Multi-Oriented Text with Corner-based Region Proposals](https://arxiv.org/pdf/1804.02690.pdf) | 0.876	| 0.845 | [`*CAFFE(M)`](https://github.com/xhzdeng/crpn)
+'18-arXiv	| 18/04/24	| [An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches](https://arxiv.org/pdf/1804.09003.pdf)	| 0.92	| 0.86	|
+'18-IJCAI	| 18/05/03	| [IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection](https://arxiv.org/pdf/1805.01167.pdf)	| | [0.9047](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=34989) |	
+'18-arXiv	| 18/06/07	| [Shape Robust Text Detection with Progressive Scale Expansion Network](https://arxiv.org/pdf/1806.02559.pdf) | |		[0.8721](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=1&m=37493)	| [`PRJ`](https://github.com/whai362/PSENet)
+'18-ECCV	| 18/07/04	| [TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes](https://arxiv.org/pdf/1807.01544.pdf) | |	0.826	|
+'18-ECCV	| 18/07/06	| [Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes](https://arxiv.org/pdf/1807.02242.pdf) | 0.917	| 0.86 |
+'18-ECCV	| 18/07/10	| [Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping](https://arxiv.org/pdf/1807.03547.pdf) |	0.892	|
+'19-AAAI    | 18/11/21  | [Scene Text Detection with Supervised Pyramid Context Network](https://arxiv.org/pdf/1811.08605.pdf)  | 0.921 | 0.872 | 
+
+<p align='center'>
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/detection_ic13_results.png' height = '550px'>
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/detection_ic15_results.png' height = '550px'>
+</p>
+
+## Text Recognition
+* Papers are sorted by published date.
+* IC is shorts for ICDAR.
+* Score is word-accuracy for recognition task.
+  * For results on IC03, IC13, and IC15 dataset, papers used different numbers of samples per paper,  
+  but we did not distinguish between them
+* `*CODE` means official code and `CODE(M)` means that trained model is provided.  
+
+ <!---
+*Conf.* | *Date* | *Title* | *SVT* | *SVT-P* | *IIIT5k* | *CUTE80* | *IC03* | *IC13* | *IC15* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
+'15-ICLR	| 14/12/18	| [Deep structured output learning for unconstrained text recognition](https://arxiv.org/pdf/1412.5903.pdf) |	0.717 | | | | 0.896	| 0.818 | | [`TF`](https://github.com/AlexandreSev/Structured_Data) <br> [`SLIDE`](https://www.robots.ox.ac.uk/~vgg/publications/2015/Jaderberg15a/presentation.pdf) <br> [`VIDEO`](https://www.youtube.com/watch?v=NYkG38RCoRg)
+'16-IJCV	| 15/05/07	| [Reading text in the wild with convolutional neural networks](https://arxiv.org/pdf/1412.1842.pdf)	| 0.807	|	| | |	0.933	| 0.908 | | [`KERAS`](https://github.com/mathDR/reading-text-in-the-wild)
+'16-AAAI	| 15/06/14	| [Reading Scene Text in Deep Convolutional Sequences](https://arxiv.org/pdf/1506.04395.pdf)
+'17-TPAMI	| 15/07/21	| [An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition](https://arxiv.org/pdf/1507.05717.pdf)	| 0.808	| 0.668	| 0.782 | |	0.894	| 0.867 | | [`TORCH(M)`](https://github.com/bgshih/crnn)  <br> [`TF`](https://github.com/weinman/cnn_lstm_ctc_ocr)  <br> [`TF`](https://github.com/watsonyanghx/CNN_LSTM_CTC_Tensorflow) <br> [`TF`](https://github.com/MaybeShewill-CV/CRNN_Tensorflow) <br> [`PYTORCH`](https://github.com/meijieru/crnn.pytorch)  <br> [`BLOG(KR)`](https://medium.com/@mldevhong/%EB%85%BC%EB%AC%B8-%EB%B2%88%EC%97%AD-rcnn-an-end-to-end-trainable-neural-network-for-image-based-sequence-recognition-and-its-f6456886d6f8)
+'16-CVPR	| 16/03/09 |	[Recursive Recurrent Nets with Attention Modeling for OCR in the Wild](https://arxiv.org/pdf/1603.03101.pdf) |	0.807 | |	0.784	| |	0.887	| 0.9 |
+'16-CVPR	| 16/03/12	| [Robust scene text recognition with automatic rectification](https://arxiv.org/pdf/1603.03915.pdf) |	0.819	| 0.718 |	0.819 |	0.592	| 0.901	| 0.886 | | [`PYTORCH`](https://github.com/marvis/ocr_attention) <br> [`PYTORCH`](https://github.com/WarBean/tps_stn_pytorch) 
+'16-CVPR	| 16/06/27	| [CNN-N-Gram for Handwriting Word Recognition](https://www.cs.tau.ac.il/~wolf/papers/CNNNGram.pdf) |	0.8362 | | | | | | | [`VIDEO`](https://www.youtube.com/watch?v=czc2Ipm3Bis)
+'16-BMVC	| 16/09/19	| [STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition](http://www.visionlab.cs.hku.hk/publications/wliu_bmvc16.pdf) |	0.836	| 0.735	| 0.833	| |	0.899	| 0.891 |
+'17-arXiv	| 17/07/27	| [STN-OCR: A single Neural Network for Text Detection and Text Recognition](https://arxiv.org/pdf/1707.08831.pdf) | 0.798	| |	0.86	| | |	0.903 | | [`*MXNET(M)`](https://github.com/Bartzi/stn-ocr) <br> [`PRJ`](https://bartzi.de/research/stn-ocr) <br> [`BLOG`](https://medium.com/@Synced/stn-ocr-a-single-neural-network-for-text-detection-and-text-recognition-220debe6ded4)
+'17-IJCAI	| 17/08/19	| [Learning to Read Irregular Text with Attention Mechanisms](https://faculty.ist.psu.edu/zzhou/paper/IJCAI17-IrregularText.pdf) | |	0.758	| |	0.693 |
+'17-arXiv	| 17/09/06	| [Scene Text Recognition with Sliding Convolutional Character Models](https://arxiv.org/pdf/1709.01727.pdf) |	0.765	| |	0.816	| |	0.845 |	0.852 |
+'17-ICCV	| 17/09/07	| [Focusing Attention: Towards Accurate Text Recognition in Natural Images](https://arxiv.org/pdf/1709.02054.pdf) |	0.859 | |	0.874 | |	0.942 |	0.933 |	0.706 |
+'18-CVPR	| 17/11/12	| [AON: Towards Arbitrarily-Oriented Text Recognition](https://arxiv.org/pdf/1711.04226.pdf) |	0.828	| 0.73	| 0.87	| 0.768 |	0.915	| |	0.682 |
+'17-NIPS	| 17/12/04	| [Gated Recurrent Convolution Neural Network for OCR](https://papers.nips.cc/paper/6637-gated-recurrent-convolution-neural-network-for-ocr.pdf)	| 0.815	| |	0.808 | |	0.978 | | | [`*TORCH(M)`](https://github.com/Jianfeng1991/GRCNN-for-OCR)
+'18-AAAI	| 18/01/04	| [Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition](http://www.visionlab.cs.hku.hk/publications/wliu_aaai18.pdf) |	0.844	| 0.735	| 0.836	| |	0.915 |	0.908 |
+'18-AAAI	| 18/01/04	| [SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network](https://pdfs.semanticscholar.org/0e59/f7d7e9c9380b425a94038c7a2500b2f6063a.pdf) | | |	0.87	| |	0.931 |	0.929 |
+'18-CVPR	| 18/05/09  | [Edit Probability for Scene Text Recognition](https://arxiv.org/pdf/1805.03384.pdf) | 0.875 | 0.883 | 0.946 | 0.944 |
+'18-TPAMI	| 18/06/25	| [ASTER: An Attentional Scene Text Recognizer with Flexible Rectification](http://122.205.5.5:8071/UpLoadFiles/Papers/ASTER_PAMI18.pdf) |	0.936	| 0.934 |	0.945	| 0.918	| [`*TF(M)`](https://github.com/bgshih/aster)
+'18-ECCV	| 18/09/08	| [Synthetically Supervised Feature Learning for Scene Text Recognition](http://openaccess.thecvf.com/content_ECCV_2018/papers/Yang_Liu_Synthetically_Supervised_Feature_ECCV_2018_paper.pdf) |	0.871	| 0.894	| 	0.947	| 0.94 |
+'19-AAAI    | 18/09/18  | [Scene Text Recognition from Two-Dimensional Perspective](https://arxiv.org/pdf/1809.06508.pdf)   | 0.821 | 0.92 | | 0.914 |
+-->
+
+*Conf.* | *Date* | *Title* | *SVT* | *IIIT5k* | *IC03* | *IC13* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: | :---: | :---: |
+'15-ICLR	| 14/12/18	| [Deep structured output learning for unconstrained text recognition](https://arxiv.org/pdf/1412.5903.pdf) |	0.717 | |0.896	| 0.818 | [`TF`](https://github.com/AlexandreSev/Structured_Data) <br> [`SLIDE`](https://www.robots.ox.ac.uk/~vgg/publications/2015/Jaderberg15a/presentation.pdf) <br> [`VIDEO`](https://www.youtube.com/watch?v=NYkG38RCoRg)
+'16-IJCV	| 15/05/07	| [Reading text in the wild with convolutional neural networks](https://arxiv.org/pdf/1412.1842.pdf)	| 0.807	|	| 	0.933	| 0.908 | [`KERAS`](https://github.com/mathDR/reading-text-in-the-wild)
+'16-AAAI	| 15/06/14	| [Reading Scene Text in Deep Convolutional Sequences](https://arxiv.org/pdf/1506.04395.pdf)
+'17-TPAMI	| 15/07/21	| [An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition](https://arxiv.org/pdf/1507.05717.pdf)	| 0.808	| 0.782 | 0.894	| 0.867 | [`TORCH(M)`](https://github.com/bgshih/crnn)  <br> [`TF`](https://github.com/weinman/cnn_lstm_ctc_ocr)  <br> [`TF`](https://github.com/watsonyanghx/CNN_LSTM_CTC_Tensorflow) <br> [`TF`](https://github.com/MaybeShewill-CV/CRNN_Tensorflow) <br> [`PYTORCH`](https://github.com/meijieru/crnn.pytorch)  <br> [`BLOG(KR)`](https://medium.com/@mldevhong/%EB%85%BC%EB%AC%B8-%EB%B2%88%EC%97%AD-rcnn-an-end-to-end-trainable-neural-network-for-image-based-sequence-recognition-and-its-f6456886d6f8)
+'16-CVPR	| 16/03/09  | [Recursive Recurrent Nets with Attention Modeling for OCR in the Wild](https://arxiv.org/pdf/1603.03101.pdf) |	0.807 | 0.784	| 0.887	| 0.9 |
+'16-CVPR	| 16/03/12	| [Robust scene text recognition with automatic rectification](https://arxiv.org/pdf/1603.03915.pdf) |	0.819	| 0.819 | 0.901	| 0.886 | [`PYTORCH`](https://github.com/marvis/ocr_attention) <br> [`PYTORCH`](https://github.com/WarBean/tps_stn_pytorch) 
+'16-CVPR	| 16/06/27	| [CNN-N-Gram for Handwriting Word Recognition](https://www.cs.tau.ac.il/~wolf/papers/CNNNGram.pdf) |	0.8362 | | | | [`VIDEO`](https://www.youtube.com/watch?v=czc2Ipm3Bis)
+'16-BMVC	| 16/09/19	| [STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition](http://www.visionlab.cs.hku.hk/publications/wliu_bmvc16.pdf) |	0.836	| 0.833	|	0.899	| 0.891 |
+'17-arXiv	| 17/07/27	| [STN-OCR: A single Neural Network for Text Detection and Text Recognition](https://arxiv.org/pdf/1707.08831.pdf) | 0.798	| 0.86	| |	0.903 | [`*MXNET(M)`](https://github.com/Bartzi/stn-ocr) <br> [`PRJ`](https://bartzi.de/research/stn-ocr) <br> [`BLOG`](https://medium.com/@Synced/stn-ocr-a-single-neural-network-for-text-detection-and-text-recognition-220debe6ded4)
+'17-IJCAI	| 17/08/19	| [Learning to Read Irregular Text with Attention Mechanisms](https://faculty.ist.psu.edu/zzhou/paper/IJCAI17-IrregularText.pdf) |
+'17-arXiv	| 17/09/06	| [Scene Text Recognition with Sliding Convolutional Character Models](https://arxiv.org/pdf/1709.01727.pdf) |	0.765	| 0.816	| 0.845 |	0.852 |
+'17-ICCV	| 17/09/07	| [Focusing Attention: Towards Accurate Text Recognition in Natural Images](https://arxiv.org/pdf/1709.02054.pdf) |	0.859 | 0.874 | 0.942 |	0.933 |
+'18-CVPR	| 17/11/12	| [AON: Towards Arbitrarily-Oriented Text Recognition](https://arxiv.org/pdf/1711.04226.pdf) |	0.828	|0.87	| 0.915	|
+'17-NIPS	| 17/12/04	| [Gated Recurrent Convolution Neural Network for OCR](https://papers.nips.cc/paper/6637-gated-recurrent-convolution-neural-network-for-ocr.pdf)	| 0.815	| 0.808 | 0.978 | | [`*TORCH(M)`](https://github.com/Jianfeng1991/GRCNN-for-OCR)
+'18-AAAI	| 18/01/04	| [Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition](http://www.visionlab.cs.hku.hk/publications/wliu_aaai18.pdf) |	0.844	| 0.836	| 0.915 |	0.908 |
+'18-AAAI	| 18/01/04	| [SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network](https://pdfs.semanticscholar.org/0e59/f7d7e9c9380b425a94038c7a2500b2f6063a.pdf) | |	0.87	| 0.931 |	0.929 |
+'18-CVPR	| 18/05/09  | [Edit Probability for Scene Text Recognition](https://arxiv.org/pdf/1805.03384.pdf) | 0.875 | 0.883 | 0.946 | 0.944 |
+'18-TPAMI	| 18/06/25	| [ASTER: An Attentional Scene Text Recognizer with Flexible Rectification](http://122.205.5.5:8071/UpLoadFiles/Papers/ASTER_PAMI18.pdf) |	0.936	| 0.934 |	0.945	| 0.918	| [`*TF(M)`](https://github.com/bgshih/aster)
+'18-ECCV	| 18/09/08	| [Synthetically Supervised Feature Learning for Scene Text Recognition](http://openaccess.thecvf.com/content_ECCV_2018/papers/Yang_Liu_Synthetically_Supervised_Feature_ECCV_2018_paper.pdf) |	0.871	| 0.894	| 	0.947	| 0.94 |
+'19-AAAI    | 18/09/18  | [Scene Text Recognition from Two-Dimensional Perspective](https://arxiv.org/pdf/1809.06508.pdf)   | 0.821 | 0.92 | | 0.914 |
+<p align='center'>
+ <!---
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/recognition_svt_results.png' height = '400px'>
+ -->
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/recognition_ic13_results.png' height = '550px'>
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/recognition_iiit5k_results.png' height = '550px'>
+</p>
+
+## End-to-End Text Recognition
+* Papers are sorted by published date.
+* IC is shorts for ICDAR.
+* Score is F1-score for generic task. 
+    * (L) stands for score in [leader-board](http://rrc.cvc.uab.es/).
+* `*CODE` means official code and `CODE(M)` means that trained model is provided.
+
+<!---
+*Conf.* | *Date* | *Title* | *IC03* | *IC13* | *IC15* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: | :---: |
+'12-ICPR    | 12/11/11  | [End-to-end text recognition with convolutional neural networks](https://ai.stanford.edu/~ang/papers/ICPR12-TextRecognitionConvNeuralNets.pdf)    | 0.67 | | | [`*CODE`](http://cs.stanford.edu/people/twangcat/ICPR2012_code/SceneTextCNN_demo.tar)
+'14-ECCV    | 14/09/06  | [Deep Features for Text Spotting](https://www.robots.ox.ac.uk/~vgg/publications/2014/Jaderberg14/jaderberg14.pdf) | 0.75  | | | [`PRJ`](http://www.robots.ox.ac.uk/~vgg/research/text/) <br> [`MATLAB`](https://bitbucket.org/jaderberg/eccv2014_textspotting)
+'15-IJCV    | 15/05/07  | [Reading Text in the Wild with Convolutional Neural Networks](https://arxiv.org/pdf/1412.1842.pdf)    | 0.70  | 0.77 | | [`KERAS`](https://github.com/mathDR/reading-text-in-the-wild)
+'15-TPAMI   | 15/10/30  | [Real-time Lexicon-free Scene Text Localization and Recognition](http://cmp.felk.cvut.cz/~neumalu1/Neumann_TPAMI2015.pdf) | | 0.542 | 0.156 |
+'16-arXiv   | 16/04/10  | [TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild](https://arxiv.org/pdf/1604.02619.pdf) | | 0.6843 | 0.4718 <br> (L)[0.533](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=4&m=7807) | [`*CAFFE(M)`](https://github.com/lluisgomez/TextProposals)
+'17-AAAI    | 16/11/21  | [TextBoxes: A fast text detector with a single deep neural network](https://arxiv.org/pdf/1611.06779.pdf) | | 0.84 | | [`TF`](https://github.com/shinjayne/shinTB) <br> [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes) <br> [`BLOG_KR`](http://jaynewho.com/post/6)
+'17-ICCV    | 17/07/13  | [Towards End-to-end Text Spotting with Convolution Recurrent Neural Network](https://arxiv.org/pdf/1707.03985.pdf) | | 0.8459 | | [`VIDEO`](https://www.youtube.com/watch?v=j0guWqBJ0lA)
+'17-ICCV    | 17/10/22  | [Deep TextSpotter An End-to-End Trainable Scene Text Localization and Recognition Framework](http://openaccess.thecvf.com/content_ICCV_2017/papers/Busta_Deep_TextSpotter_An_ICCV_2017_paper.pdf) | | 0.77 | 0.47 | [`VIDEO`](https://www.youtube.com/watch?v=VcNSQGO0j7s) <br> [`*CAFFE(M)`](https://github.com/MichalBusta/DeepTextSpotter)
+'18-CVPR    | 18/01/05  | [FOTS: Fast Oriented Text Spotting with a Unified Network](https://arxiv.org/pdf/1801.01671.pdf) | | [0.8477](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=4&m=34627) | [0.6533](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=4&m=34626) | [`VIDEO`](https://www.youtube.com/watch?v=F7TTYlFr2QM)
+'18-TIP     | 18/01/09  | [TextBoxes++: A Single-Shot Oriented Scene Text Detector](https://arxiv.org/pdf/1801.02765.pdf) | | [0.8465](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=4&m=27895) | 0.519 | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes_plusplus)
+'18-CVPR    | 18/03/09  | [An end-to-end TextSpotter with Explicit Alighment and Attention](https://arxiv.org/pdf/1803.03474.pdf) | | 0.86 | 0.63 | [`*CAFFE(M)`](https://github.com/tonghe90/textspotter)
+'18-TPAMI   | 18/06/25  | [ASTER: An Attentional Scene Text Recognizer with Flexible Rectification](http://122.205.5.5:8071/UpLoadFiles/Papers/ASTER_PAMI18.pdf) | | | 0.64 | [`*TF(M)`](https://github.com/bgshih/aster)
+'18-ECCV    | 18/07/06  | [Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes](https://arxiv.org/pdf/1807.02242.pdf) | | 0.865 | 0.624 |
+-->
+
+*Conf.* | *Date* | *Title* | *IC03* | *IC13* | *IC15* | *Resources* |
+:---: | :---: |:--- | :---: | :---: | :---: | :---: |
+'12-ICPR    | 12/11/11  | [End-to-end text recognition with convolutional neural networks](https://ai.stanford.edu/~ang/papers/ICPR12-TextRecognitionConvNeuralNets.pdf)    | 0.67 | | | [`*CODE`](http://cs.stanford.edu/people/twangcat/ICPR2012_code/SceneTextCNN_demo.tar)
+'14-ECCV    | 14/09/06  | [Deep Features for Text Spotting](https://www.robots.ox.ac.uk/~vgg/publications/2014/Jaderberg14/jaderberg14.pdf) | 0.75  | | | [`PRJ`](http://www.robots.ox.ac.uk/~vgg/research/text/) <br> [`MATLAB`](https://bitbucket.org/jaderberg/eccv2014_textspotting)
+'15-IJCV    | 15/05/07  | [Reading Text in the Wild with Convolutional Neural Networks](https://arxiv.org/pdf/1412.1842.pdf)    | 0.70  | 0.77 | | [`KERAS`](https://github.com/mathDR/reading-text-in-the-wild)
+'15-TPAMI   | 15/10/30  | [Real-time Lexicon-free Scene Text Localization and Recognition](http://cmp.felk.cvut.cz/~neumalu1/Neumann_TPAMI2015.pdf) | | 0.542 | 0.156 |
+'16-arXiv   | 16/04/10  | [TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild](https://arxiv.org/pdf/1604.02619.pdf) | | 0.6843 | 0.4718 <br> (L)[0.533](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=4&m=7807) | [`*CAFFE(M)`](https://github.com/lluisgomez/TextProposals)
+'17-AAAI    | 16/11/21  | [TextBoxes: A fast text detector with a single deep neural network](https://arxiv.org/pdf/1611.06779.pdf) | | 0.84 | | [`TF`](https://github.com/shinjayne/shinTB) <br> [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes) <br> [`BLOG_KR`](http://jaynewho.com/post/6)
+'17-ICCV    | 17/07/13  | [Towards End-to-end Text Spotting with Convolution Recurrent Neural Network](https://arxiv.org/pdf/1707.03985.pdf) | | 0.8459 | | [`VIDEO`](https://www.youtube.com/watch?v=j0guWqBJ0lA)
+'17-ICCV    | 17/10/22  | [Deep TextSpotter An End-to-End Trainable Scene Text Localization and Recognition Framework](http://openaccess.thecvf.com/content_ICCV_2017/papers/Busta_Deep_TextSpotter_An_ICCV_2017_paper.pdf) | | 0.77 | 0.47 | [`VIDEO`](https://www.youtube.com/watch?v=VcNSQGO0j7s) <br> [`*CAFFE(M)`](https://github.com/MichalBusta/DeepTextSpotter)
+'18-CVPR    | 18/01/05  | [FOTS: Fast Oriented Text Spotting with a Unified Network](https://arxiv.org/pdf/1801.01671.pdf) | | [0.8477](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=4&m=34627) | [0.6533](http://rrc.cvc.uab.es/?ch=4&com=evaluation&view=method_info&task=4&m=34626) | [`VIDEO`](https://www.youtube.com/watch?v=F7TTYlFr2QM)
+'18-TIP     | 18/01/09  | [TextBoxes++: A Single-Shot Oriented Scene Text Detector](https://arxiv.org/pdf/1801.02765.pdf) | | [0.8465](http://rrc.cvc.uab.es/?ch=2&com=evaluation&view=method_info&task=4&m=27895) | 0.519 | [`*CAFFE(M)`](https://github.com/MhLiao/TextBoxes_plusplus)
+'18-CVPR    | 18/03/09  | [An end-to-end TextSpotter with Explicit Alighment and Attention](https://arxiv.org/pdf/1803.03474.pdf) | | 0.86 | 0.63 | [`*CAFFE(M)`](https://github.com/tonghe90/textspotter)
+'18-TPAMI   | 18/06/25  | [ASTER: An Attentional Scene Text Recognizer with Flexible Rectification](http://122.205.5.5:8071/UpLoadFiles/Papers/ASTER_PAMI18.pdf) | | | 0.64 | [`*TF(M)`](https://github.com/bgshih/aster)
+'18-ECCV    | 18/07/06  | [Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes](https://arxiv.org/pdf/1807.02242.pdf) | | 0.865 | 0.624 |
+<p align='center'>
+<img src = 'https://oss.navercorp.com/CLOVA-AI-OCR/RSC.Papers/blob/master/end2end_ic13_ic15_results.png' height = '400px'>
+</p>
+
+## Others
+* Papers are sorted by published date.
+* `*CODE` means official code and `CODE(M)` means that trained model is provided.
+
+<!---
+*Conf.* | *Date* | *Title* | *Description* | *Resources* |
+:---: | :---: |:--- | :---: | :---: |
+'14-NIPS	| 14/06/09  |   [Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition](https://arxiv.org/pdf/1406.2227.pdf) |	Dataset | [`PRJ`](http://www.robots.ox.ac.uk/~vgg/data/text/)
+'17-ECCV	| 17/02/13  |	[End-to-End Interpretation of the French Street Name Signs Dataset](https://arxiv.org/pdf/1702.03970.pdf) |	Dataset (FSNS) | [`*TF(M)`](https://github.com/tensorflow/models/tree/master/research/attention_ocr)
+'17-arXiv	| 17/04/11	|	[Attention-based Extraction of Structured Information from Street View Imagery](https://arxiv.org/pdf/1704.03549.pdf) |	FSNS | [`*TF(M)`](https://github.com/tensorflow/models/tree/master/research/attention_ocr) <br> [`TF`](https://github.com/da03/Attention-OCR) <br> [`TF`](https://github.com/emedvedev/attention-ocr) <br> [`LUA`](https://github.com/da03/torch-Attention-OCR) <br> [`BLOG_KR`](https://norman3.github.io/papers/docs/attention_ocr.html)
+'17-CVPR	| 17/07/21	|	[Unambiguous Text Localization and Retrieval for Cluttered Scenes](http://openaccess.thecvf.com/content_cvpr_2017/papers/Rong_Unambiguous_Text_Localization_CVPR_2017_paper.pdf) |	Text Retrieval
+'17-AAAI	| 17/10/22	|	[Detection and Recognition of Text Embedded in Online Images via Neural Context Models](http://s-space.snu.ac.kr/bitstream/10371/116866/1/aaai2017_cameraready.pdf) |	Dataset | [`PRJ`](https://github.com/cmkang/CTSN)
+'18-CVPR	| 17/11/17	|	[Separating Style and Content for Generalized Style Transfer](https://arxiv.org/pdf/1711.06454.pdf) |	Font Style
+'17-arXiv	| 17/12/06	|	[Detecting Curve Text in the Wild New Dataset and New Solution](https://arxiv.org/pdf/1712.02170.pdf) |	Dataset (CTW 1500) | [`PRJ`](https://github.com/Yuliang-Liu/Curve-Text-Detector)
+'18-AAAI	| 17/12/14	|	[SEE: Towards Semi-Supervised End-to-End Scene Text Recognition](https://arxiv.org/pdf/1712.05404.pdf) |	FSNS | [`PRJ`](https://bartzi.de/research/see) <br> [`*CHAINER(M)`](https://github.com/Bartzi/see)
+'17-CVPR	| 18/06/07	|	[Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks](https://arxiv.org/pdf/1706.02337.pdf) |	Document Layout | [`PRJ`](http://personal.psu.edu/xuy111/projects/cvpr2017_doc.html)
+'18-CVPR	| 18/06/19	|	[DocUNet: Document Image Unwarping via A Stacked U-Net](http://www.juew.org/publication/DocUNet.pdf) |	Document Dewarping | [`PRJ`](http://www3.cs.stonybrook.edu/~cvl/docunet.html)
+'18-CVPR	| 18/06/19	|	[Document Enhancement using Visibility Detection](http://webee.technion.ac.il/~ayellet/Ps/18-KKT.pdf) |	Document Enhancement | [`PRJ`](http://cgm.technion.ac.il/Computer-Graphics-Multimedia/Software/VisibilityDetection/)
+'18-IJCAI	| 18/06/22	|	[Multi-Task Handwritten Document Layout Analysis](https://arxiv.org/pdf/1806.08852.pdf) |	Document Layout
+'18-ECCV	| 18/07/09	|	[Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes](https://arxiv.org/pdf/1807.03021.pdf) |	Dataset | [`PRJ`](https://github.com/fnzhan/Verisimilar-Image-Synthesis-for-Accurate-Detection-and-Recognition-of-Texts-in-Scenes)
+'19-AAAI	| 18/12/03  |	[EnsNet: Ensconce Text in the Wild](https://arxiv.org/pdf/1812.00723.pdf) | Text Removal |	[`DB`](https://github.com/HCIILAB/Scene-Text-Removal)
+-->
+
+*Conf.* | *Date* | *Title* | *Description* | *Resources* |
+:---: | :---: |:--- | :---: | :---: |
+'14-NIPS	| 14/06/09  |   [Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition](https://arxiv.org/pdf/1406.2227.pdf) |	Dataset | [`PRJ`](http://www.robots.ox.ac.uk/~vgg/data/text/)
+'17-ECCV	| 17/02/13  |	[End-to-End Interpretation of the French Street Name Signs Dataset](https://arxiv.org/pdf/1702.03970.pdf) |	Dataset (FSNS) | [`*TF(M)`](https://github.com/tensorflow/models/tree/master/research/attention_ocr)
+'17-arXiv	| 17/04/11	|	[Attention-based Extraction of Structured Information from Street View Imagery](https://arxiv.org/pdf/1704.03549.pdf) |	FSNS | [`*TF(M)`](https://github.com/tensorflow/models/tree/master/research/attention_ocr) <br> [`TF`](https://github.com/da03/Attention-OCR) <br> [`TF`](https://github.com/emedvedev/attention-ocr) <br> [`LUA`](https://github.com/da03/torch-Attention-OCR) <br> [`BLOG_KR`](https://norman3.github.io/papers/docs/attention_ocr.html)
+'17-CVPR	| 17/07/21	|	[Unambiguous Text Localization and Retrieval for Cluttered Scenes](http://openaccess.thecvf.com/content_cvpr_2017/papers/Rong_Unambiguous_Text_Localization_CVPR_2017_paper.pdf) |	Text Retrieval
+'17-AAAI	| 17/10/22	|	[Detection and Recognition of Text Embedded in Online Images via Neural Context Models](http://s-space.snu.ac.kr/bitstream/10371/116866/1/aaai2017_cameraready.pdf) |	Dataset | [`PRJ`](https://github.com/cmkang/CTSN)
+'18-CVPR	| 17/11/17	|	[Separating Style and Content for Generalized Style Transfer](https://arxiv.org/pdf/1711.06454.pdf) |	Font Style
+'17-arXiv	| 17/12/06	|	[Detecting Curve Text in the Wild New Dataset and New Solution](https://arxiv.org/pdf/1712.02170.pdf) |	Dataset (CTW 1500) | [`PRJ`](https://github.com/Yuliang-Liu/Curve-Text-Detector)
+'18-AAAI	| 17/12/14	|	[SEE: Towards Semi-Supervised End-to-End Scene Text Recognition](https://arxiv.org/pdf/1712.05404.pdf) |	FSNS | [`PRJ`](https://bartzi.de/research/see) <br> [`*CHAINER(M)`](https://github.com/Bartzi/see)
+'17-CVPR	| 18/06/07	|	[Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks](https://arxiv.org/pdf/1706.02337.pdf) |	Document Layout | [`PRJ`](http://personal.psu.edu/xuy111/projects/cvpr2017_doc.html)
+'18-CVPR	| 18/06/19	|	[DocUNet: Document Image Unwarping via A Stacked U-Net](http://www.juew.org/publication/DocUNet.pdf) |	Document Dewarping | [`PRJ`](http://www3.cs.stonybrook.edu/~cvl/docunet.html)
+'18-CVPR	| 18/06/19	|	[Document Enhancement using Visibility Detection](http://webee.technion.ac.il/~ayellet/Ps/18-KKT.pdf) |	Document Enhancement | [`PRJ`](http://cgm.technion.ac.il/Computer-Graphics-Multimedia/Software/VisibilityDetection/)
+'18-IJCAI	| 18/06/22	|	[Multi-Task Handwritten Document Layout Analysis](https://arxiv.org/pdf/1806.08852.pdf) |	Document Layout
+'18-ECCV	| 18/07/09	|	[Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes](https://arxiv.org/pdf/1807.03021.pdf) |	Dataset | [`PRJ`](https://github.com/fnzhan/Verisimilar-Image-Synthesis-for-Accurate-Detection-and-Recognition-of-Texts-in-Scenes)
+'19-AAAI	| 18/12/03  |	[EnsNet: Ensconce Text in the Wild](https://arxiv.org/pdf/1812.00723.pdf) | Text Removal |	[`DB`](https://github.com/HCIILAB/Scene-Text-Removal)
+
+## Other lists
+* OCR Paper Curation
+    * [whitelok](https://github.com/whitelok/image-text-localization-recognition)
+    * [tangzhenyu](https://github.com/tangzhenyu/Scene-Text-Understanding)
+    * [wanghaisheng](https://github.com/wanghaisheng/awesome-ocr)
+    * [ChanChiChoi](https://github.com/ChanChiChoi/awesome-ocr/blob/master/README.md)
+    * [handong1587](https://github.com/handong1587/handong1587.github.io/blob/master/_posts/deep_learning/2015-10-09-ocr.md)
+    * [hs105](https://github.com/hs105/Deep-Learning-for-OCR)
+
+## Tutorial Materials
+* Lecture slides
+    * [Deep Neural Networks for Scene Text Reading (IC17 Keynote)](http://u-pat.org/ICDAR2017/keynotes/ICDAR2017_Keynote_Prof_Bai.pdf)
+    * [Oriented Scene Text Detection Revisited (VALSE17 Invited Talk)](http://cloud.eic.hust.edu.cn:8071/~xbai/Talk_slice/Oriented-Scene-Text-Detection-Revisited_VALSE2017.pdf)
+    * [Scene Text Detection and Recognition (Joint course of Megvii Inc. & Peking Univ.)](https://zsc.github.io/megvii-pku-dl-course/slides/Lecture7(Text%20Detection%20and%20Recognition_20171031).pdf)
+    * [Classic Text Detectors](https://www.slideshare.net/anyline_io/text-detection-strategies)
+* Survey Paper
+    * [Scene text detection and recognition: recent advances and future trends](https://www.researchgate.net/profile/Xiang_Bai4/publication/286945604_Scene_text_detection_and_recognition_recent_advances_and_future_trends/links/57f720f408ae886b8981d364/Scene-text-detection-and-recognition-recent-advances-and-future-trends.pdf)
--- a/detection_ic13_results.png
+++ b/detection_ic13_results.png
--- a/detection_ic15_results.png
+++ b/detection_ic15_results.png
--- a/end2end_ic13_ic15_results.png
+++ b/end2end_ic13_ic15_results.png
--- a/overall_histogram.png
+++ b/overall_histogram.png
--- a/overall_pi_chart.png
+++ b/overall_pi_chart.png
--- a/recognition_ic13_results.png
+++ b/recognition_ic13_results.png
--- a/recognition_iiit5k_results.png
+++ b/recognition_iiit5k_results.png
--- a/recognition_svt_results.png
+++ b/recognition_svt_results.png