mirror of
https://github.com/CCExtractor/ccextractor.git
synced 2024-12-25 20:32:10 +00:00
Lots of corrections to text.
Added show-specific dictionaries for Grey's Anatomy, Mr. Robot and The Big Bang Theory (Code-in: Deborah Chan)
This commit is contained in:
commit
a1411968b8
34
Dictionary/dict_greys.anatomy.txt
Normal file
34
Dictionary/dict_greys.anatomy.txt
Normal file
@ -0,0 +1,34 @@
|
||||
Grey’s Anatomy
|
||||
Meredith Grey
|
||||
Lexie Grey
|
||||
Ellis Grey
|
||||
Thatcher Grey
|
||||
Derek Shepherd
|
||||
Amelia Shepherd
|
||||
Owen Hunt
|
||||
Maggie Pierce
|
||||
Teddy
|
||||
Dr. Altman
|
||||
Seattle Grace Hospital
|
||||
Mercy West Medical Center
|
||||
Seatle Grace Mercy West Hospital
|
||||
Denny Duquette Memorial Clinic
|
||||
Grey Sloan Memorial Hospital
|
||||
Mayo Clinic
|
||||
Cleveland Clinic
|
||||
Portland General Hospital
|
||||
Seattle Presbyterian Hospital
|
||||
Klausman Institute for Medical Research
|
||||
Roseridge Home for Extended Care
|
||||
Veterans Rehabilitation Center
|
||||
Trauma Center
|
||||
Emergency Room
|
||||
Intensive Care Unit
|
||||
Neonatal Intensive Care Unit
|
||||
Operating Room
|
||||
On-Call Room
|
||||
Chasing Cars
|
||||
Snow Patrol
|
||||
|
||||
|
||||
|
11
Dictionary/dict_mr_robot.txt
Normal file
11
Dictionary/dict_mr_robot.txt
Normal file
@ -0,0 +1,11 @@
|
||||
Mr. Robot
|
||||
Angela Moss
|
||||
Phillip Price
|
||||
Federal Bureau of Investigation
|
||||
Fun Society
|
||||
Dark Army
|
||||
New York
|
||||
Evil Corp Headquarters
|
||||
Allsafe Cybersecurity
|
||||
Ron’s Coffee
|
||||
Python
|
15
Dictionary/dict_the.big.bang.theory.txt
Normal file
15
Dictionary/dict_the.big.bang.theory.txt
Normal file
@ -0,0 +1,15 @@
|
||||
The Big Bang Theory
|
||||
Penny
|
||||
Sheldon’s Spot
|
||||
The Apartment Building
|
||||
Apartment 4A/B
|
||||
The Laundry Room
|
||||
The Roof
|
||||
Wolowitz House
|
||||
Capitol Comics
|
||||
The Cheesecake Factory
|
||||
The Comic Center of Pasadena
|
||||
California Institute of Technology
|
||||
Massachusetts Institute of Technology
|
||||
Jet Propulsion Laboratory
|
||||
Pasadena
|
@ -107,7 +107,7 @@
|
||||
0.73 - GSOC (2014-08-19)
|
||||
------------------------
|
||||
- Added support of BIN format for Teletext
|
||||
- Added start of librarisation. This will allow in the future for other programs to use encoder/decoder functions and more.
|
||||
- Added start of librarization. This will allow in the future for other programs to use encoder/decoder functions and more.
|
||||
|
||||
0.72 - GSOC (2014-08-12)
|
||||
------------------------
|
||||
@ -480,7 +480,7 @@ version of CCExtractor.
|
||||
-----------------
|
||||
- Add -nosync and -fullbin switches for debugging purposes.
|
||||
- Remove -lg (--largegops) switch.
|
||||
- Improve syncronization of captions for source files with
|
||||
- Improve synchronization of captions for source files with
|
||||
jumps in their time information or gaps in the caption
|
||||
information.
|
||||
- [R. Abarca] Changed Mac script, it now compiles/link
|
||||
@ -583,11 +583,11 @@ version of CCExtractor.
|
||||
|
||||
0.45 (2008-11-14)
|
||||
-----------------
|
||||
- Added autodetection of DVR-MS files.
|
||||
- Added auto-detection of DVR-MS files.
|
||||
- Added -asf to force DVR-MS mode.
|
||||
- Added some specific support for DVR-MS
|
||||
files. These format used to work
|
||||
correcty in 0.34 (pure luck) but the
|
||||
correctly in 0.34 (pure luck) but the
|
||||
MPEG code rework broke it. It should
|
||||
work as it used to.
|
||||
- Updated Windows GUI to support the
|
||||
@ -693,7 +693,7 @@ version of CCExtractor.
|
||||
being closed).
|
||||
- Displayed memory is now written to disk at end
|
||||
of stream even if there is no command requesting
|
||||
so (may prevent losing the last screenful).
|
||||
so (may prevent losing the last screen-full).
|
||||
- Important change that could break scripts, but
|
||||
that have been added because old behaviour was
|
||||
annoying to most people: _1 and _2 at the end
|
||||
|
@ -1,12 +1,12 @@
|
||||
Overview
|
||||
========
|
||||
FFmpeg Intigration was done to support multiple encapsulator.
|
||||
FFmpeg Intigration was done to support multiple encapsulations.
|
||||
|
||||
Dependecy
|
||||
=========
|
||||
FFmpeg library's
|
||||
|
||||
Download and Install FFmpeg on your linux pc.
|
||||
Download and Install FFmpeg on your Linux pc.
|
||||
---------------------------------------------
|
||||
|
||||
Download latest source code from following link
|
||||
@ -15,8 +15,8 @@ https://ffmpeg.org/download.html
|
||||
then following command to install ffmpeg
|
||||
./configure && make && make install
|
||||
|
||||
Note:If you installed ffmpeg on non standurd location, please change/update your
|
||||
enviorment variable $PATH and $LD_LIBRARY_PATH
|
||||
Note:If you installed ffmpeg on non-standard location, please change/update your
|
||||
environment variable $PATH and $LD_LIBRARY_PATH
|
||||
|
||||
Download and Install FFmpeg on your Windows pc.
|
||||
----------------------------------------------
|
||||
|
@ -3,7 +3,7 @@ G608
|
||||
G608 (for grid 608) is generated by CCExtractor by using -out=g608.
|
||||
|
||||
This is a verbose format that exports the contents of the 608 grid verbatim
|
||||
so there's no loss of positioning or colors due the limitations or complexy
|
||||
so there's no loss of positioning or colors due the limitations or complexity
|
||||
or other output formats.
|
||||
|
||||
G608 is a text file with a structure based on .srt and looks like this:
|
||||
|
10
docs/OCR.txt
10
docs/OCR.txt
@ -1,14 +1,14 @@
|
||||
|
||||
Overview
|
||||
========
|
||||
OCR (Optical Character Recognisation ) is an technique used to
|
||||
extract text from images. In the World of Subtile, subtitle stored
|
||||
in bitmap format are common and even neccassary. for converting subtile
|
||||
in bitmap format to subtilte in text format ocr is used.
|
||||
OCR (Optical Character Recognition) is a technique used to
|
||||
extract text from images. In the World of Subtitle, subtitle stored
|
||||
in bitmap format are common and even necessary for converting subtitle
|
||||
in bitmap format to subtitle in text format ocr is used.
|
||||
|
||||
Dependency
|
||||
==========
|
||||
Tesseract (OCR library by google)
|
||||
Tesseract (OCR library by Google)
|
||||
Leptonica (image processing library)
|
||||
|
||||
How to compile ccextractor on linux with OCR
|
||||
|
@ -1,4 +1,4 @@
|
||||
For building ccextractor using cmake folllow below steps..
|
||||
For building ccextractor using cmake follow steps below..
|
||||
|
||||
Step 1) Check you have right version of cmake installed. ( version >= 3.0.2 )
|
||||
We are using CMP0037 policy of cmake which was introduced in 3.0.0
|
||||
@ -6,12 +6,12 @@ Step 1) Check you have right version of cmake installed. ( version >= 3.0.2 )
|
||||
suggest to use 3.0.2 or higher version.
|
||||
|
||||
|
||||
Step 2) create a seprate directory where you want to build the target.
|
||||
In Unix you can do it using follwing commands.
|
||||
Step 2) create a separate directory where you want to build the target.
|
||||
In Unix you can do it using following commands.
|
||||
~> cd ccextractor
|
||||
~> mkdir build
|
||||
|
||||
Step 3) make the build sytem using cmake
|
||||
Step 3) make the build system using cmake
|
||||
~> cmake ../src/
|
||||
|
||||
Step 4) Compile the code.
|
||||
|
Loading…
Reference in New Issue
Block a user