ccextractor/docs
Anshul Maheshwari 0b76cc1991 Add more tapping points while debugging tesseract
When OCR_DEBUG is defined, code will dump more images
to find root cause of failed OCR
2018-12-22 15:12:46 +05:30
..
img Added Visual Studio compilation guide (#1029) 2018-10-28 20:26:44 -07:00
708_STATUS.TXT changes 2015-08-19 20:25:07 +03:00
AUTHORS.TXT Version push to 0.87 2018-01-11 12:42:18 -08:00
BINARY_FILE_FORMAT.TXT Initial git. 2014-04-12 12:41:27 +02:00
ccextractor.cnf.sample Modify git-ignore 2018-01-22 21:28:22 +05:30
CHANGES.TXT Add more tapping points while debugging tesseract 2018-12-22 15:12:46 +05:30
CODE_ORGANIZATION.TXT Initial git. 2014-04-12 12:41:27 +02:00
COMPILATION.MD Added Visual Studio compilation guide (#1029) 2018-10-28 20:26:44 -07:00
FFMPEG.TXT Update FFMPEG.TXT 2016-12-02 11:02:24 +05:30
freetype.TXT Add FreeType (#876) 2018-01-09 10:24:06 -08:00
FRONTEND_COMMUNICATIONS.TXT Update FRONTEND_COMMUNICATIONS.TXT 2016-12-02 11:04:09 +05:30
G608.TXT Update G608.TXT 2016-12-02 13:33:36 +05:30
gpl-2.0.txt Added GPL 2 to the document directory 2016-01-23 09:14:26 +01:00
guidoc.md Add documentation for GUI (#796) 2017-10-13 14:42:03 -07:00
HARDSUBX.txt Update instructions to install with HardSubx 2018-05-15 12:39:23 +05:30
MAILINGLIST.TXT Updated mailing list (changed sourceforge for google groups) 2018-02-21 20:21:24 +00:00
OCR.md Correct compilation guide for ocr 2018-10-28 13:27:45 +05:30
Python extension module.txt update python extension doc 2018-01-24 16:00:50 +08:00
raspberrypi.md Add compilation section to readme and improve instructions. (#836) 2017-12-13 13:01:10 -08:00
README.TXT Version push to 0.87 2018-01-11 12:42:18 -08:00
Updating_Dependencies.txt [IMPROVEMENT] Update libGPAC (#974) 2018-04-23 11:24:03 -07:00
using_cmake_build.txt Correct typographical error (#984) 2018-05-31 09:46:00 -07:00

CCExtractor
(check AUTHORS.TXT for history and developers)
----------------------------------------------

License
-------
GPL 2.0. 


Description
-----------
Since the original port, the whole code has been rewritten (more than once,
one might add) and support for most subtitle formats around the world has
been added (teletext, DVB, CEA-708, ISDB...)

Basic Usage 
-----------
(please run ccextractor with no parameters for the complete manual -
this is for your convenience, really).

ccextractor reads a video stream looking for closed captions (subtitles).
It can do two things:

- Save the data to a "raw", unprocessed file which you can later use
  as input for other tools, such as McPoodle's excellent suite. 
- Generate a subtitles file (.srt,.smi, or .txt) which you can directly 
  use with your favourite player.

Running ccextractor without parameters shows the help screen. Usage is 
trivial - you just need to pass the input file and (optionally) some
details about the input and output files.


Languages
---------
Usually English captions are transmitted in line 21 field 1 data,
using channel 1, so the default values are correct so you don't
need to do anything and you don't need to understand what it all
means.

If you want the Spanish captions, you may need to play a bit with
the parameters. From what I've been, Spanish captions are usually
sent in field 2, and sometimes in channel 2. 

So try adding these parameter combinations to your other parameters.

-2 
-cc2
-2 -cc2

If there are Spanish subtitles, one of them should work. 

McPoodle's page
---------------
http://www.theneitherworld.com/mcpoodle/SCC_TOOLS/DOCS/SCC_TOOLS.HTML

Essential CC related information and free (with source) tools.

Encoding
--------
This version, in both its Linux and Windows builds generates by
default Unicode files. You can use -latin1 and -utf8 if you prefer 
these encodings (usually it just depends on what your specific
player likes).

Future work
-----------
- Please check www.ccextractor.org for news and future work.