Lots of corrections to text.

Added show-specific dictionaries for Grey's Anatomy, Mr. Robot and The Big Bang Theory (Code-in: Deborah Chan)
This commit is contained in:
Carlos Fernandez 2016-11-29 10:24:16 -08:00
commit a1411968b8
10 changed files with 88 additions and 28 deletions

View File

@ -0,0 +1,34 @@
Greys Anatomy
Meredith Grey
Lexie Grey
Ellis Grey
Thatcher Grey
Derek Shepherd
Amelia Shepherd
Owen Hunt
Maggie Pierce
Teddy
Dr. Altman
Seattle Grace Hospital
Mercy West Medical Center
Seatle Grace Mercy West Hospital
Denny Duquette Memorial Clinic
Grey Sloan Memorial Hospital
Mayo Clinic
Cleveland Clinic
Portland General Hospital
Seattle Presbyterian Hospital
Klausman Institute for Medical Research
Roseridge Home for Extended Care
Veterans Rehabilitation Center
Trauma Center
Emergency Room
Intensive Care Unit
Neonatal Intensive Care Unit
Operating Room
On-Call Room
Chasing Cars
Snow Patrol

View File

@ -0,0 +1,11 @@
Mr. Robot
Angela Moss
Phillip Price
Federal Bureau of Investigation
Fun Society
Dark Army
New York
Evil Corp Headquarters
Allsafe Cybersecurity
Rons Coffee
Python

View File

@ -0,0 +1,15 @@
The Big Bang Theory
Penny
Sheldons Spot
The Apartment Building
Apartment 4A/B
The Laundry Room
The Roof
Wolowitz House
Capitol Comics
The Cheesecake Factory
The Comic Center of Pasadena
California Institute of Technology
Massachusetts Institute of Technology
Jet Propulsion Laboratory
Pasadena

View File

@ -107,7 +107,7 @@
0.73 - GSOC (2014-08-19)
------------------------
- Added support of BIN format for Teletext
- Added start of librarisation. This will allow in the future for other programs to use encoder/decoder functions and more.
- Added start of librarization. This will allow in the future for other programs to use encoder/decoder functions and more.
0.72 - GSOC (2014-08-12)
------------------------
@ -480,7 +480,7 @@ version of CCExtractor.
-----------------
- Add -nosync and -fullbin switches for debugging purposes.
- Remove -lg (--largegops) switch.
- Improve syncronization of captions for source files with
- Improve synchronization of captions for source files with
jumps in their time information or gaps in the caption
information.
- [R. Abarca] Changed Mac script, it now compiles/link
@ -583,11 +583,11 @@ version of CCExtractor.
0.45 (2008-11-14)
-----------------
- Added autodetection of DVR-MS files.
- Added auto-detection of DVR-MS files.
- Added -asf to force DVR-MS mode.
- Added some specific support for DVR-MS
files. These format used to work
correcty in 0.34 (pure luck) but the
correctly in 0.34 (pure luck) but the
MPEG code rework broke it. It should
work as it used to.
- Updated Windows GUI to support the
@ -693,7 +693,7 @@ version of CCExtractor.
being closed).
- Displayed memory is now written to disk at end
of stream even if there is no command requesting
so (may prevent losing the last screenful).
so (may prevent losing the last screen-full).
- Important change that could break scripts, but
that have been added because old behaviour was
annoying to most people: _1 and _2 at the end

View File

@ -1,12 +1,12 @@
Overview
========
FFmpeg Intigration was done to support multiple encapsulator.
FFmpeg Intigration was done to support multiple encapsulations.
Dependecy
=========
FFmpeg library's
Download and Install FFmpeg on your linux pc.
Download and Install FFmpeg on your Linux pc.
---------------------------------------------
Download latest source code from following link
@ -15,8 +15,8 @@ https://ffmpeg.org/download.html
then following command to install ffmpeg
./configure && make && make install
Note:If you installed ffmpeg on non standurd location, please change/update your
enviorment variable $PATH and $LD_LIBRARY_PATH
Note:If you installed ffmpeg on non-standard location, please change/update your
environment variable $PATH and $LD_LIBRARY_PATH
Download and Install FFmpeg on your Windows pc.
----------------------------------------------

View File

@ -3,7 +3,7 @@ G608
G608 (for grid 608) is generated by CCExtractor by using -out=g608.
This is a verbose format that exports the contents of the 608 grid verbatim
so there's no loss of positioning or colors due the limitations or complexy
so there's no loss of positioning or colors due the limitations or complexity
or other output formats.
G608 is a text file with a structure based on .srt and looks like this:

View File

@ -1,14 +1,14 @@
Overview
========
OCR (Optical Character Recognisation ) is an technique used to
extract text from images. In the World of Subtile, subtitle stored
in bitmap format are common and even neccassary. for converting subtile
in bitmap format to subtilte in text format ocr is used.
OCR (Optical Character Recognition) is a technique used to
extract text from images. In the World of Subtitle, subtitle stored
in bitmap format are common and even necessary for converting subtitle
in bitmap format to subtitle in text format ocr is used.
Dependency
==========
Tesseract (OCR library by google)
Tesseract (OCR library by Google)
Leptonica (image processing library)
How to compile ccextractor on linux with OCR

View File

@ -1,4 +1,4 @@
For building ccextractor using cmake folllow below steps..
For building ccextractor using cmake follow steps below..
Step 1) Check you have right version of cmake installed. ( version >= 3.0.2 )
We are using CMP0037 policy of cmake which was introduced in 3.0.0
@ -6,12 +6,12 @@ Step 1) Check you have right version of cmake installed. ( version >= 3.0.2 )
suggest to use 3.0.2 or higher version.
Step 2) create a seprate directory where you want to build the target.
In Unix you can do it using follwing commands.
Step 2) create a separate directory where you want to build the target.
In Unix you can do it using following commands.
~> cd ccextractor
~> mkdir build
Step 3) make the build sytem using cmake
Step 3) make the build system using cmake
~> cmake ../src/
Step 4) Compile the code.