模式识别之ocr---文字识别Tesseract-OCR 进行文字识别 VS2010

模式识别之ocr---文字识别Tesseract-OCR 进行文字识别 VS2010
近日做铸件文字识别的项目，需要识别铸件上的字符和数字，找到开源的识别库Tesseract，下面简单记录下怎么使用。

首先在项目主页http://code.google.com/p/tesseract-ocr/ 下载库和相应的字库。由于本人使用的是VS2010，其lib和include等库使用的VS2008进行编译的，所以一直出错。用VS2010的同学可以在这里下载编译好的VS2010的相应的库。

然后进行配置，和其他库的配置类似，include lib dll。
1. #include "allheaders.h"
2. #include "baseapi.h"
3. #include "strngs.h"
4. #include <cv.h>
5. #include <highgui.h>
6. #include <iostream>
7. using namespace cv;
8. using namespace std;
10. int _tmain(int argc, _TCHAR* argv[])
11. {
12. char *image_path="zj.jpg";
13. tesseract::TessBaseAPI api;
14. api.Init(NULL,"eng",tesseract::OEM_DEFAULT);
16. api.SetPageSegMode(tesseract::PSM_AUTO);
18. FILE* fin = fopen(image_path, "rb");
19. if (fin == NULL) {
20. printf("Cannot open input file: %s ", image_path);
21. exit(2);
22. }
23. fclose(fin);
25. PIX *pixs;
26. if ((pixs = pixRead(image_path)) == NULL) {
27. printf("Unsupported image type. ");
28. exit(3);
29. }
30. pixDestroy(&pixs);
32. STRING text_out;
33. if (!api.ProcessPages(image_path, NULL, 0, &text_out)) {
34. printf("Error during processing. ");
35. }
37. cout<<"识别结果为："<<text_out.string();
39. return 0;
40. }
http://blog.csdn.net/lanbing510/article/details/28696833
相关阅读:
4单元练习
 3单元C#练习（重复太多，差别着写）
===习题
 .NET预习
 NET_.NET深入体验与实践精要----第四章
 NET 第一章
 C#认证考试试题汇编 Test
_.NET深入体验与实战精要.pdf第四章
 C# 练习
 .NET 学习
原文地址：https://www.cnblogs.com/pengkunfan/p/4044186.html

热门文章
判断奇偶性数列相加
 对字符串进行加密
 对UWP的理解
 第4章
 第四单元
 第三单元
 初步认识.NET
了解UWP
c#认证习题
 ListView