Commit Graph

1696 Commits

Author SHA1 Message Date
Null
84a9ea5572 Fix OCR issue caused by separated dvb subtitle regions (#857) 2017-12-28 09:19:04 +01:00
Alex Huang
5e8a5590ce reworks scanning newlines to look for content in a line (#858) 2017-12-28 08:33:59 +01:00
Null
fb55d6d6d3 [Fix] Put OCR specific code inside ifdef (#855)
* Fix failing travis build.

Removed debug code. No idea why it causes travis fails to build

* Fix debug code
2017-12-27 07:33:16 +01:00
Null
b0afb983c9 [FIX] Fix a crash while processing DVB subtitles (#850)
* Add more debug info

* Fix crash in dvb process
2017-12-26 10:08:26 +01:00
Null
e56bab67b8 [Fix] Fix DVB bug (Multiple-line subtitle; Missing last line) (#844)
* multiple line & trying to fix the missing last line

* Fix format; move code into loop

* Revert some format changes
2017-12-26 06:14:11 +01:00
Chuck Wilson
59b8f81283 [FEATURE] Support for Source-Specific Multicast (#802)
* Support for Source-Specific Multicast

* fixing whitespace issues

* updating changelog
2017-12-24 06:10:14 +01:00
Null
31c39eea55 [FIX] Fix crash when image passed into OCR is empty (#841)
* Fix crash when image passed into OCR is empty

* Avoid OCR
2017-12-23 01:20:41 +01:00
Manish Mahalwal
f9a0874e58 Fixed -sentencecap for teletext samples (#842) 2017-12-23 01:20:16 +01:00
Anshul Maheshwari
1858425944
Merge pull request #822 from MatejMecka/master
[IMPROVEMENT] Upgrade UTF8proc
2017-12-16 15:43:59 +05:30
Null
4263a341e1
Turn on optimization 2017-12-16 10:41:40 +08:00
Null
d70c81d1d1 Make CMake work with Windows 2017-12-12 17:45:04 +08:00
Carlos Fernandez Sanz
fe17cddec8 Fixed 8 warnings 2017-12-07 13:17:09 -08:00
Carlos Fernandez
33c4c5a021 Import teletext.h from lib_ccx.h for function definitions that are used from general_loop 2017-12-07 12:34:15 -08:00
Carlos Fernandez
04ea39073b Moved some declarations from telxcc.c to teletext.h - needed to avoid some warnings about implicit declarations.
gzguts.h: Moved UNIX-only include to #ifdef to prevent build errors on Windows.
2017-12-07 12:29:30 -08:00
Carlos Fernandez Sanz
d7aa1f1bf4 - Fix some warnings (wrong parameter type, implicit function declarations...) 2017-12-07 12:14:50 -08:00
Carlos Fernandez
d50e05315f The #ifdef PYTHON_API crap was preventing the write context to be deinitialized - meaning files not being closed, semaphores not deleted... 2017-12-06 18:25:22 -08:00
Null
3fb5bab343 Fix delay in DVB (#826) 2017-12-06 07:53:21 -08:00
Anshul Maheshwari
4bcd1edff5
Merge pull request #825 from anshul1912/master
Add MXF support
2017-12-06 14:14:46 +05:30
Anshul Maheshwari
42ab16405d Add MXF support 2017-12-06 11:36:41 +05:30
Null
c78db1dd24 Fix delay option (#824) 2017-12-05 08:45:47 -08:00
MatejMecka
76dc969363 Upgrade UTF8proc 2017-12-04 21:41:53 +01:00
Null
241e2f5e14 Upgrade win_iconv (#815) 2017-12-02 20:52:29 -08:00
Matej Plavevski
afdf4e74be Upgrade zlib to 1.2.11 (#814) 2017-12-02 20:49:41 -08:00
govindbalaji-s
a9180719b6 Made fatal errror messages clearer (#811) 2017-12-01 07:38:41 -08:00
Matej Plavevski
88844fea42 Update LibPNG from 1.6.27 to 1.6.34 (#809) 2017-11-30 12:24:01 -08:00
HemangRajvanshy
f5700d5304 Making error messages clearer and less ambiguous. (#808) 2017-11-30 08:40:53 -08:00
Carlos Fernandez
5db65fa3a1 #fidef ENABLE_PYTHON in ccx_encoders_python.c. 2017-11-07 12:38:25 -08:00
AlexBratosin2001
bdb8221213 Replace incorrect memset (#800) 2017-10-23 19:38:07 -07:00
Evgeny Shulgin
a6f0a07bf9 Merge branch 'master' into something 2017-10-18 13:22:23 -07:00
Evgeny Shulgin
1816894ccf Temporarily wrapped the Python API 2017-10-18 23:15:54 +03:00
Carlos Fernandez
07f289d1e0 Added missing function prototype.
Added ccextractor.h to solution.
2017-10-18 12:55:46 -07:00
Vitor Massaru Iha
824dfeb166 [IMPROVEMENT] function header in "get_more_data" functions were standarlized. (#786)
* function header in "get_more_data" functions were standarlized.

* Unnecessary stream_mode check inside the while loop was removed.

* terminate_asap if condition was moved to while condition.

* Unnecessary condition was removed.
2017-10-13 14:53:37 -07:00
Vitor Massaru Iha
6aa90fc091 [gpacmp4/avilib.c]: Fix redefinition of VERSION and PACKAGE (#790)
compilation warnings before:

../src/gpacmp4/avilib.c:35:0: warning: "PACKAGE" redefined
 #define PACKAGE "GPAC/avilib"

<command-line>:0:0: note: this is the location of the previous definition
../src/gpacmp4/avilib.c:36:0: warning: "VERSION" redefined
 #define VERSION GPAC_FULL_VERSION
2017-10-13 14:47:27 -07:00
Saksham Gupta
5641c3116e Fixes Undeclared-Variable Warnings in extractor.c (#795)
Fixes https://github.com/CCExtractor/ccextractor/issues/780

Signed-off-by: Saksham Gupta <shucon01@gmail.com>
2017-10-13 14:42:29 -07:00
Mayank Gupta
fc3f505189 Add GUI for CCExtractor (GSoC 2017) (#794) 2017-10-11 14:11:39 -07:00
Saurabh Shrivastava
0d9872021d ¯\_(ツ)_/¯ Fix typo in name of Zlib directory in cmakefile. (#784) 2017-10-03 12:30:10 -07:00
Saurabh Shrivastava
30443a5b9a Use proper newlines while printing SRT from bitmaps. (#783)
Probably fixes #767 .
2017-10-03 08:24:34 -07:00
Saurabh Shrivastava
2eb5fd26de [FIX] Move files into appropriate directories & fix build scripts. (#781)
* Move wrappers and extracters inside src/ and update CMakeLists.

* Reflect change in path across build scripts.

* Remove redundant source file inclusion.

* Always use supplied libpng.
2017-10-02 12:16:04 -07:00
Hugh Mackworth
01852ef055 Compilation on the Mac (#777)
* Update README.md

* Delete README.MAC.TXT

No longer accurate given work done to integrate Mac into build processes.

* Change to use project's PNG/ZLIB libraries

* Fix Mac build command
Makes OCR an optional parameter
Adds python API file to build

* Update README.md
2017-10-02 11:59:00 -07:00
Vinícius Lugão
f8d9e042bb Fix to output CC data when -out=raw is used (#775)
When the -out=raw option is used, the ccextractor jumped to spupng output
format, generating broken files in spupng format without CC data.
With this fix, now it generates CC data in McPoodle's Broadcast format.
2017-09-08 10:06:00 -07:00
Diptanshu Jamgade
47c5a6e73b Cleaning up the codebase and additional changes in Python SRT generator. (#771)
* Removed all extractors except the grid extractor.
Removed the call to transcript extractor in ccx_encoders_transcript.c

* Removed unnecessary array appening statements in python_grid_extractor.
WIP: switch in extractor.

* Added switch in g608 grid extractor.

* Deleted comments from wrappers.

* Refactored code in ccextractor.c and .h files.
Removed all the commented part.
Made proper changes according to the coding conventions.

* Removed calls to extractor from all the encoders.
The only call made to extractor is from ccx_encoders_python.c.

* Removed a comment from wrapper.c.
In init_write function of output.c added a call to free the output string returned by asprintf in case of
sending filename to callback function.

* Added calls to free the char* which is malloced by asprintf in
extractor.c
WIP: Free the global variable elements.

* Sample testing correctly for italics tag.
Also added a hack to print only 32 characters when unicode fails.
WIP: Font tag.

* Added support for handling font and italics in Python SRT generator.

* modified the font generator.
Also, added count method for checking blank strings in
python_srt_generator.

* Added free statements for avoiding memory leaks.

* added return code for failure of asprintf calls.

* Removing unnecessary code from api_testing.py

* Made modifications to Makefile and build script.

* Added recursive_tester.py
Autoconf builds successfully.

* BUG: Made change to get_line_encoded to encode the last \0 character in a
line. Otherwise the EOL characted is absent causing garbage value to be
present in SRT.

* Exporting the encoding of the captions from CCExtractor to Python so
that the python SRT generator can generate proper SRT files.

* Modified the include statement in extractor.h
2017-08-25 11:03:00 -07:00
Saurabh Shrivastava
d19f471352 Correctly handle return codes. (#763)
Return code after parameter parsing were incorrectly handles leading to errors such as `Error: Invalid option to CCextractor Library`.
2017-08-21 14:11:19 -07:00
Saurabh Shrivastava
4fe82abbfc Get commit hash and compilation date when built using cmake. (#764)
Who knew I would have to read so much documentation for such trivial task 😒
2017-08-20 08:55:09 -07:00
Diptanshu Jamgade
21eaa3de04 Python bindings with extraction of CE608 grid and writing to a SRT output. (#768)
* added python_extract to encoders_srt and the captions are being
extracted in needed format. Search for an alternative to asprintf

* Checking if the alternative to asprintf generate proper srts

* CC captions accessible via python script

* Removing python caption code from __wrap_write function

* removing old cc_to_python functions

* Removing python_subs structure and all the changes done for that struct

* Removing filename functions from ccextractor.*

* Renaming make_message to time_wrapper

* Applying to python_extract codebase: SSA format

* Added python_extract_time_based and done validation for ssa

* pplying python_extract_time_based: Done validation for srt and webvtt

* led attempt for SAMI support of python_extract. Code is commented

* Appluing python_extract_time_based: validate support for SMPTETT

* Added python_extract_transcript and made changes for time printing.

* added show_extracted_captions_wtih_timings function

* Added show_extracted_captions_with_timings to python script for testing
purpose.

* refactored extractors to api directory. commented out show captions in main()

* build and build library working for the extractors.

* made caption generator work with a 0.1 time sleep. Start refactoring

* added asprintf for windows.

* file being written in the running directory

* Auto -deletion of python temporary file

* Python captions printing status set to proper.

* termination of tail successful

* Writing successful for the sample

* Generating unalternating output

* adding api_support.py

* Adding bld_flags in build_api

* Added  to build_library

* Auto deletion of temporary file on SIGINT

* Discussing Seg fault with Izaron

* working for python and linux with samples. testing -out=pythonapi with stream

* Done adding bitmap support

* added -out=pythonapi support for bitmap

* Setting the messages_target to 0 for output = pythonapi

* Added wrapper for setting -out=pythonapi. Checking if -stdout value can be used in python.

* adding the cc_to_stdout=1 value for -out=pythonapi. Thus generation of output file has been avoided. May be needed to change in future.

* added extractor for g608 grid. removed sami extractor. need to work on overlap of -out=pythonapi and -out=g608

* Removed overlap of -out=pythonapi by adding -pythonapi and
signal_python_api global variable.

* added support for seperate c608 grid catching. Need to test the output
via python.

* added support for seperate printing of text font and color in CE608.
Need to make sure that the function is inbuilt.

* ADDED ce608 GRID SUPPORT FROM PYTHON
need to discuss whether to keep the print_cc_grid function specific to
the module or make it user accessible.
Mostly it would be better to make it user accessible.

* made changes in the call_from_python_api function such that only
api_options is needed to be passed.
An if statement before the call to g608_extractor has also been added.
Waiting for Carlos to comment on the output generated till this stage.

* added a signal_python_api check before calling every write function.
Thus basic writing output can be avoided.

* Commented all calls to python_extract_time_based.
making changes to python_extract_g608 to be called only from the point
when a g608 caption is detected.

* Added pass_cc_buffer_to_python in encoders_common.c temporarily
redefined get_*_encoded from static to normal
included the above functions in encoders_common.h

* Added if-else statement for switch in encode_sub function.
This is done mainly for making sure no output is generated in the api
call.

* Added ccx_encoders_python.c
Defined pass_cc_buffer_to_python in ccx_encoders_python.c
added if else statement in encode_sub's switch to make sure that the output is not generated in case of -pythonapi call

* Removed __wrap_write from the entire code base.
It's declaration and definition are only present in CCExtractor.*

* Commented out the /dev/null part in ccx_encoders_common.c.
Proceeding further on checking for file generation.

* Added output_filename in array global variable and is generated in
init_write function.
included ccextractor.h in output.c to access global variable
signal_python_api for avoiding output generation in init_write and
invalid free in dinit_write.

* Modified the definition of init_write function for accessing
signal_python_api.

* Deleted the commented part of /dev/null in ccx_encoders_common.c.

* Added target_message=0 in -pythonapi param parsing in param.c to avoid
the API from printing to STDOUT.
Deleted the commented part of -out=pythonapi.
Thinking of adding a different param for silencing the output when the
call is made from python api.

* Removed __wrap_write from ccextractor.c and ccextractor.h.

* Added ccx_to_python_g608 and modified api_support.py file.
added documentation in ccextractor.c.

* added the generate srt script. However, some random characters are
coming in first line. Need to talk about this.

* Added SRT generator for python.
Using string to remove the garbage value.
Add code for srt counter and also the start_time and end_time
conversion.

* removed the trash characters and added code to print the timings.
However, the last blank frame also results in a print. Need to take care
of this.

* rectified the mistake of writing only timings and not captions.
now next step is to just make the timings print properly

* some minor changes before diving into extracting srt_counter from the made codebase

* Added extraction of srt_counter in python_extract via fflush
srt_counter-value.
Need to modify the processing in python.

* Added the entire method to extract captions and generate srt files. Next, step would be a to define a concise function for writing the srt

* Processing into a srt working properly.
Next step is to add the information of font into the caption text.

* the data is getting generated for proper SRT counters.

* A turning point to the appraoch.
Added END OF FRAME line for printing the data for every particular
srt_counter.
Proceeding further with the generation of srt by data manipulation.

* some minor bugs but the output srt is being generated correctly. However, The font and colour encoding needs to be done.

* Taken care of random characters. Need to discuss this with Carlos. Moving further to font/color processing.

* Taken care of random characters. Need to discuss this with Carlos. Moving further to font/color processing.

* Added fflush and cleaned up the python code of srt generation

* Added <i> tag for italics.
Proceeding further with other types.

* Added the code to check for underline.
However, need to check how CCExtractor generates srt when both italics
and underline are present. For now a new line is added if both are
present.

* Shifting for making changes in th i/O work.

* Stable ouput for samples with italics is being generated.

* Added the PYTHONAPI macro definition and testing for its existence in the set_python_api function.

* build script for linux is working correctly.
Build_library is showing error of invalid def of set_pythonapi.
Moreover, extractor has some memory seg fault.

* Added mod to set a MACRO as my_python_api to set the callback function.
Till now all calls to the reporter are commented.
Working on getting the reporter to print the lines.

* Changes have been implemented to bring reporter in working state.
For now a constant string is passed from extractor. Need to make the
proper parsing possible.

* Changed the code in extractor such that entire grid is returned to the
callback function.
Need to provide this grid to the write function and also cleanup the
codebase.

* Writing the outputted srt in a file called "temp.srt".
Need to modify init_write to push filename that is to be created in
python using callback.

* Added code to get start and end time simultaneously.
entire SRT is getting generated.

* removed ccx_python_encoders.c

* Compiling and executing on Windows

* Moved definitions get_line_encoded, get_color_encoded, get_font_encoded from ccx_encoders_g608.c to ccx_encoders_common.c.
Also, deleted the static definition of get_font_encoded from
ccx_encoders_webvtt.c

* added a write statement in write_cc_bitmap_as_srt

* Rectified transfer of get_line_encoded, get_color_encoded and
get_font_encoded from ccx_decoders_common.c to ccx_encoders_common.c.
2017-08-20 08:54:35 -07:00
Evgeny Shulgin
14e0d86df8 Added gui mode reports for Matroska decoder 2017-08-20 15:08:20 +03:00
Saurabh Shrivastava
da0893fdb3 Fix CMakeLists for MacOS and Linux.
With #742 and this, CCExtractor could be build across all three platforms using CMake.
2017-07-26 04:23:48 +05:30
Carlos Fernandez
ce2b680a43 Merge branch 'pr/n759_Abhinav95' 2017-07-21 11:25:24 -07:00
Abhinav95
b1cc95d972 Adding grayscale conversion for better OCR 2017-07-21 12:12:50 +05:30
Carlos Fernandez
536082ae6e Merge branch 'pr/n751_Diptanshu8' 2017-07-19 10:59:56 -07:00
Diptanshu8
3f069b84c9 fixed -out=dvdraw sample error. 2017-07-18 04:48:08 +00:00
Carlos Fernandez
ddca8001cc Merge branch 'pr/n755_Abhinav95' 2017-07-17 11:44:11 -07:00
Diptanshu8
02b4427260 making changes to write wrapper 2017-07-17 08:59:00 +00:00
Abhinav95
ec5618dd1f Fixing end timestamp in DVB transcripts + spelling/readme improvements 2017-07-17 04:23:34 +05:30
Carlos Fernandez
e8f742a627 Corrected function prototype 2017-07-14 13:01:39 -07:00
Saurabh Shrivastava
45946e3ac9 Initialise timing for MP4 webvtt.
Fixes #753 .
2017-07-14 18:59:02 +05:30
Diptanshu8
e3e5f8b36e Apply write wrapper across entire database. 2017-07-13 07:26:49 +00:00
Diptanshu8
1435411861 Commenting out the file name related functions. 2017-07-13 05:48:14 +00:00
Diptanshu8
86b7e7348e Added extension to python_subs 2017-07-11 21:34:05 +00:00
Diptanshu8
d2bd2d1397 added basefilename to python_subs 2017-07-11 21:21:18 +00:00
Diptanshu8
57424857b0 Working on PR 2017-07-11 21:21:18 +00:00
Diptanshu8
2ced408994 build and build_library working correctly 2017-07-11 21:21:18 +00:00
Diptanshu8
976f01cee1 CCs to python_subs extracted properly 2017-07-11 21:17:46 +00:00
Diptanshu8
4d5f80a01d Found wrapper for write. Check file_handle and start processing. 2017-07-11 21:17:46 +00:00
Carlos Fernandez
0327e676dd Merge branch 'pr/n747_Diptanshu8' 2017-07-11 11:40:10 -07:00
Diptanshu8
fe6813736c segregating the code and changing myarguments and argument_count. Also, gsoc directory has been created. 2017-07-06 22:58:23 +00:00
Diptanshu8
dc35af0bc0 Modifications to the code. 2017-07-06 22:22:59 +00:00
Carlos Fernandez
0c0bf1aafd -Added -nospupngocr (don't OCR bitmaps when generating spupng, faster) 2017-07-06 13:37:20 -07:00
Carlos Fernandez
62dab0dde9 Merge branch 'pr/n746_Abhinav95' 2017-07-06 12:59:26 -07:00
=
31a2d46996 Forcing -noru to cause deduplication in ISDB 2017-07-07 01:22:11 +05:30
Carlos Fernandez
710a205f99 Add support for file split on keyframe (-segmentonkeyonly)
Segmenting now doesn't destroys the whole encoding context, just closes and reopens the output file
Correct a wrong function prototype for process_hex()
OCR: Attempt to correctly deal with TessBaseAPIRecognize returning an error
Changed output for parse PMT to CCX_DMT_PMT instead of CCX_DMT_VERBOSE
2017-07-06 11:57:17 -07:00
Diptanshu8
6e50104da4 Cyclic rotation and python script argv passing solved 2017-06-28 21:35:32 +00:00
Diptanshu8
edb2431cf9 Cyclic rotation patch 2017-06-28 19:07:44 +00:00
Diptanshu8
67204d8e3c modifying the return value from main 2017-06-28 07:57:44 +00:00
Diptanshu8
a0047a9d3e changing the return status to EXIT_ON 2017-06-28 07:01:32 +00:00
Diptanshu8
cf4aa9021d Modified build_library script for generating the python module 2017-06-27 10:00:47 +00:00
Diptanshu8
3e99dc2955 Reflecting changes of library source code in the ccextractor source code 2017-06-27 10:00:47 +00:00
Diptanshu8
751a22fe68 Resolving merge conflicts in .gitignore 2017-06-27 10:00:47 +00:00
Diptanshu8
dba1d7b6eb Debugging the core dumped error 2017-06-27 10:00:07 +00:00
Diptanshu8
f70f34e009 pushing the api code 2017-06-27 10:00:07 +00:00
Diptanshu8
644d26546c Facing VERSION attribute error in generated module 2017-06-27 10:00:07 +00:00
Diptanshu8
b0a0c92e50 library development done till api_start. Implementation of stop and status function is left. 2017-06-27 10:00:07 +00:00
diptanshuj@gmail.com
d3540ccc0a Made temp changes to initiate library coding 2017-06-27 10:00:07 +00:00
Carlos Fernandez
735f4392dd Merge branch 'pr/n742_saurabhshri' 2017-06-05 13:22:21 -07:00
Carlos Fernandez
399f59981d Added --analyzevideo to help page. 2017-06-05 11:00:03 -07:00
Saurabh Shrivastava
51d936bc90 Fix CMake build for windows.
Thank you linker flags for eating my 3+ hours.
2017-06-03 22:32:15 +05:30
Carlos Fernandez
d9796410bc Added --analyzevideo 2017-06-02 12:32:45 -07:00
AlexBratosin2001
a842e1f7db Fix PTS and Length in preview section (Teletext subtitles) 2017-06-02 19:12:31 +03:00
Carlos Fernandez
adacb6235e Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-04-24 12:07:00 -07:00
Carlos Fernandez
78c410cf50 Added -nolevdist to disable automatic typo fixing in teletext 2017-04-24 12:06:33 -07:00
Carlos Fernandez Sanz
fdd5b6bf9d Merge pull request #725 from Amey-Jain/master
Issue 699 solved
2017-04-11 09:23:03 -07:00
Amey Jain
1513b7c42f Timing for sample #70 corrected. 2017-04-11 10:44:44 +05:30
Carlos Fernandez
76eea831ca When NAL decoding fails, don't dump the whole decoded thing, limit to 160 bytes. 2017-04-10 17:11:45 -07:00
Carlos Fernandez
1b6e05083d - TS: Skip NULL packets
- TS: If we don't have pinfo don't pay attention to the current_next_indicator bit.
(fixes problem with The Lion Guard_20170321_09301000.ts). Not sure this fix is the correct one but that's what VLC does.
2017-04-10 11:46:01 -07:00
Amey Jain
1b3598b2fe Timing mis-match corrected. 2017-04-08 08:34:18 +05:30
LucasYoung
dd5c1ee243 Added WebVTT output from Matroska 2017-03-30 00:09:40 -07:00
Amey Jain
094a8f295a Issue 699 solved. 2017-03-21 01:06:00 +05:30
Amey Jain
d2bea3802e In ref. to issue 699. 20 ms timing mis-match.
modified:   src/lib_ccx/general_loop.c
	modified:   src/lib_ccx/telxcc.c
2017-03-20 14:03:41 +05:30
Barun Parruck
4f692138fc Add missing line terminator in webvtt
Fix #680
2017-03-18 19:43:30 +05:30
Diptanshu8
baa70a104a char pointer misconception for newline characters 2017-03-16 16:09:06 +05:30
Diptanshu8
7212d6848a Refactoring 2017-03-16 15:58:38 +05:30
Diptanshu8
b723378eb2 Quick fixes 2017-03-16 02:39:25 +05:30
Diptanshu8
5889d1edcd Quick fixes 2017-03-16 02:29:26 +05:30
Diptanshu8
c11ff21499 Done with mkvlang options basic checking 2017-03-16 01:45:31 +05:30
Diptanshu8
b08c6285e6 Checking of the mkvlan for last option is left and checking for '-' in the code is left 2017-03-15 23:37:43 +05:30
Diptanshu8
6aaaf6d9ae MKVlang support for a multi-language extraction has been added. 2017-03-15 16:57:52 +05:30
Diptanshu8
3b0031c251 MKVlang support for a single language has been added. 2017-03-15 16:47:26 +05:30
Diptanshu8
75a010fe77 Static option _eng_ for only english subtitles has been set 2017-03-15 15:14:06 +05:30
Diptanshu8
712e44e26c Checking the concatenated videos for subtitle errors and resolved 2017-03-15 14:36:43 +05:30
Diptanshu8
9bbab6b595 ASS/SSA newline character covered 2017-03-15 00:48:45 +05:30
Diptanshu8
f944c93faa yet to cover ass/ssa subtitiles' newline 2017-03-15 00:35:40 +05:30
Diptanshu8
2c7bfef0f0 ASS/SSA support added for avoiding newline chars at the beginning of sentence. 2017-03-15 00:35:40 +05:30
Diptanshu8
6f5b2aa360 Changes made to avoid breakline/newline at the beginning of sentence. 2017-03-15 00:35:40 +05:30
Diptanshu8
d08cad3642 Rebasing 2017-03-15 00:35:17 +05:30
Diptanshu8
d2d055dd37 Done with removal of MATROSKA_MAX_TRACKS 2017-03-15 00:33:58 +05:30
Diptanshu8
7feb705d73 Rebasing 2017-03-15 00:33:12 +05:30
Saurabh Shrivastava
7b4c3bb26d Removed depricated function. 2017-03-12 00:39:57 +05:30
Saurabh Shrivastava
4bcc79fdb7 Use already available functions.
Removed code with redundant functionality.
-extracting filename without extension
-generating timestamp for srt format from milliseconds
2017-03-12 00:28:33 +05:30
Carlos Fernandez
5f510cdfa2 filename_non_ext instead of filename_without_ext in matroska.c 2017-03-06 11:32:06 -08:00
Evgeny Shulgin
f057a7db05 Added multiplatform LLD and LLU specs 2017-03-06 16:48:54 +03:00
kapil kumar
d80a4f4b3c modified subtitle file name genrated by matroska(mkv) -some changes 2017-03-05 23:57:57 +05:30
kapil kumar
fac11ec5ed modified subtitle file name genrated by matroska(mkv) 2017-03-05 23:45:43 +05:30
Carlos Fernandez Sanz
66e8b280de Merge pull request #707 from Izaron/matroska
Replaced %lld with %I64d
2017-03-05 10:05:05 -08:00
Evgeny Shulgin
564093e0ec Removed builtin code in Matroska 2017-03-05 20:18:13 +03:00
Evgeny
af3fff4034 Replaced %lld with %I64d 2017-03-05 15:05:07 +03:00
Evgeny
a66f3c3973 Added "No captions" code support in Matroska 2017-03-02 16:46:10 +03:00
Evgeny
edaa3b828b Fixed bug with sub name in Windows 2017-03-02 16:38:47 +03:00
Evgeny
76cb7b91ee Added matroska.c to filters and fixed _MSC_VER 2017-03-02 16:18:03 +03:00
Evgeny Shulgin
2048827c45 Added time for the activity progress 2017-03-02 15:49:22 +03:00
Evgeny Shulgin
1f478cfb22 Added matroska warnings about "-out=" 2017-03-02 15:43:02 +03:00
Evgeny Shulgin
e74074ffd0 Removed matroska int and byte types 2017-03-02 14:44:43 +03:00
Evgeny Shulgin
20b557ff97 Matroska main part integrated 2017-03-01 21:50:20 +03:00
Evgeny Shulgin
28f84c768e Added Matroska decoder skeleton 2017-03-01 19:40:09 +03:00
Carlos Fernandez
1d281004e7 Merge branch 'pr/n700_saurabhshri' 2017-02-28 16:35:22 -08:00
Carlos Fernandez
f6f0f79954 Merge branch 'pr/n702_kapilkd13' 2017-02-28 16:34:32 -08:00
Carlos Fernandez
656bac1a6f Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-02-28 16:33:56 -08:00
Carlos Fernandez
66344c51fb Merge branch 'pr/n703_Izaron' 2017-02-28 16:33:44 -08:00
kapil kumar
23947beecc typo error 2017-02-28 21:14:22 +05:30
kapil kumar
876a442362 added usage for Levenshtein. fixed issue 701 2017-02-28 20:31:09 +05:30
Saurabh Shrivastava
4ef4a9b03e Teltext from .bin now honours -unixts
Also, fixed no caption found even when it was found from .bin for
teltext.
2017-02-26 23:28:17 +05:30
Saurabh Shrivastava
b44fbefc97 Merge remote-tracking branch 'refs/remotes/CCExtractor/master' into ucla 2017-02-24 01:43:15 +05:30
AlexBratosin2001
cbffbb0358 Sync Teletext to Private Stream 1 instead of all streams 2017-02-23 16:33:07 +02:00
Evgeny Shulgin
52c7a8474f Added missing swscale library 2017-02-22 16:40:40 +03:00
Evgeny Shulgin
4f5f564b59 Automatically enable HARDSUBX whed FFMPEG used 2017-02-22 16:36:40 +03:00
Evgeny Shulgin
00ead85ab7 Fixed FFMPEG libs and set all libs non-static 2017-02-21 19:31:12 +03:00
Evgeny Shulgin
4ee9c847da Fixed CMake for OCR 2017-02-20 19:31:58 +03:00
Evgeny Shulgin
29180a95b1 Rewritten output handler 2017-02-20 17:39:43 +03:00
Carlos Fernandez Sanz
1124d82687 Merge pull request #685 from sidgairo18/fix_defects
Fix defects
2017-02-15 09:29:25 -08:00
Barun Parruck
e919f1b9f5 Add comments, change array iteration 2017-02-15 01:49:58 +05:30
Barun Parruck
73f3c83940 Add -lf support | make line terminator consistent 2017-02-14 16:22:27 +05:30
Siddhartha Gairola
1425f426dc Update stbl_write.c 2017-02-14 12:28:17 +05:30
Siddhartha Gairola
3e37250d44 Update networking.c 2017-02-12 15:29:48 +05:30
sidgairo18
3be78775ca Fixing memory defects 2017-02-12 15:08:45 +05:30
Saurabh Shrivastava
ab1c7ab563 Fixed missing tpage number in UCLA from BIN. 2017-02-12 14:17:34 +05:30
Barun Parruck
7c2483d73e Fix webvtt formatting
The lack of CRLF after the header led to an invalid webvtt format.
2017-02-10 14:54:42 +05:30
Abhinav Shukla
9f32edad63 Merging change to Tesseract OEM mode with new hardsubx code 2017-02-09 02:40:01 +05:30
Abhinav Shukla
3278b31a8f Setting up tickertape parameter 2017-02-09 02:30:51 +05:30
Carlos Fernandez
cf762df972 Minor typo 2017-02-06 11:11:44 -08:00
Saurabh Shrivastava
d37086a434 Removed depricated pramams and added missing one.
-o1 and -o2 were depricated in commit
0541a2fb62

--version was missing.
2017-02-06 03:21:34 +05:30
Saurabh Shrivastava
5ce5dc7fae Fix for NULL output filename.
If no -o is suppled with stdin/network etc, the output name generated
was NULL, leading to creation of files like `.srt` which were in
category of hidden files.
2017-02-06 03:11:53 +05:30
Saurabh Shrivastava
851894dceb Fatal if unable to open output file. 2017-02-06 03:11:53 +05:30
Saurabh Shrivastava
6837a1070b Printing end message after fatal. 2017-02-06 02:11:53 +05:30
Carlos Fernandez
482a20430d ffmpeg_intgr.c: Wrong directory was being used in ffmpeg #include. 2017-01-30 15:54:05 -08:00
Carlos Fernandez
85751cee2b dvbsub_parse_clut_segment: Changed return on fail (from 0 to -1, as expected by the caller). 2017-01-30 15:14:16 -08:00
Carlos Fernandez
12467815ae Closes #675
Moved final text after dinit().
2017-01-30 13:06:48 -08:00
Carlos Fernandez
0d7b33c362 Merge branch 'pr/n673_AlexBratosin2001' 2017-01-30 12:58:25 -08:00
Carlos Fernandez
79a6d3d04a Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-01-30 12:57:01 -08:00
Carlos Fernandez
6bbb649ee6 Fix crash on -xmltv -out=null due to not having an encoder context. 2017-01-30 12:56:44 -08:00
AlexBratosin2001
919d3ec3b0 Fixed timing error caused by last commit 2017-01-28 23:24:50 +02:00
AlexBratosin2001
c5ea59aeb1 Fixed huge memory leak related to OCR init 2017-01-28 22:20:08 +02:00
Willem
de958c9383 Delete .DS_Store files and add it to .gitignore file 2017-01-28 18:03:22 +01:00
Carlos Fernandez
51537e8725 networking, multicast: In linux, bind to the specific IP address of the multicast source.
OCR: init some variables that didn't have a default value.
2017-01-26 10:16:52 -08:00
Carlos Fernandez
ec9a0985ce Rework signals 2017-01-24 11:06:09 -08:00
Carlos Fernandez
71dffd6eb3 Use TessDeleteText to delete strings received from Tesseract 2017-01-23 16:05:05 -08:00
Carlos Fernandez Sanz
8dc1964f8c Merge pull request #660 from Izaron/708-adventures
Fixed ttxt 708 files segfault
2017-01-23 10:05:10 -08:00
Evgeny Shulgin
d72e946213 Fixed ttxt 708 files segfault 2017-01-23 19:29:28 +04:00
Saurabh Shrivastava
77da2dc873 Added -chapters paramater for chapter extaraction from MP$. 2017-01-21 01:43:41 +05:30
Saurabh Shrivastava
d9414782b2 Function prototype for chapter extraction. 2017-01-21 01:38:12 +05:30
Saurabh Shrivastava
d1b127164e Option for chapter extraction. 2017-01-21 01:37:13 +05:30
Saurabh Shrivastava
db8d9c67b6 By default don't extract chapters. 2017-01-21 01:35:58 +05:30
Saurabh Shrivastava
c9a3a0c7f2 Extract chapters from mp4 file and write it in a file. 2017-01-21 01:33:59 +05:30
Saurabh Shrivastava
0e4d211eaf Extract chapter instead of subtitles if extract_chapters is True. 2017-01-21 01:29:04 +05:30
Carlos Fernandez
57daaf3e4d - Correct identing in ccextractor.c
- Correct return code for multiprogram transport streams
2017-01-19 15:14:54 -08:00
Carlos Fernandez
bc1e309b13 Added "CCX_DTVCC_C0_NUL" (do nothing, but prevent the "unhandled" warning) 2017-01-19 14:53:15 -08:00
AlexBratosin2001
09778b2d14 Sped up min_pts calculation (avoided lots of unnecessary loop iterations) 2017-01-19 22:06:39 +02:00
Evgeny
89c00a7e21 Added OEM mode parameter 2017-01-19 20:57:35 +03:00
Carlos Fernandez
bb026a7318 Merge branch 'pr/n649_sidgairo18' 2017-01-17 11:53:13 -08:00
maxkoryukov
566d1284f2
Remove SBS stuff from decoder_init 2017-01-15 23:55:41 +05:00
maxkoryukov
b5b2a7d70d Probably fix the maxkoryukov/ccextractor#1 : split to sentences
This version returns enough readable subs , splitted into sentences
2017-01-15 22:37:51 +05:00
maxkoryukov
93e407f4a5 Improve SBS: fix for #639 and non-gready similarity detection
* Use own SBS-context structure to store SBS-data (fix CCExtractor/ccextractor#639)
* Search for BEST match of new string and SBS-buffer (instead of first appropriate..)
* all tests are fixed and passed
2017-01-15 22:37:51 +05:00
maxkoryukov
ad7b141cc6 Tiny fixes 2017-01-15 22:37:51 +05:00
maxkoryukov
f23beab07e Fix error with uninitialed sbs_handled_len. Free sbs_buffer on dinit_encoder_context
* more debug for SBS
2017-01-15 22:37:51 +05:00
maxkoryukov
c582175d35 Wrap debug instructions in #ifdef 2017-01-15 22:37:51 +05:00
maxkoryukov
1b1a572f73 SBS: use Levenshtein distance to detect duplicates in subs
see maxkoryukov/ccextractor#1
2017-01-15 22:37:51 +05:00
maxkoryukov
7c9ffbbde9 Levenshtein for char * in utility.c
see maxkoryukov/ccextractor#1
2017-01-15 22:37:51 +05:00
maxkoryukov
5c2d6956fd Fixed format specifiers for debug output 2017-01-15 22:37:51 +05:00
Siddhartha Gairola
fad623ed6a Update general_loop.c 2017-01-15 14:08:16 +05:30
Siddhartha Gairola
263dd2cb40 Update ccx_encoders_webvtt.c 2017-01-15 14:07:04 +05:30
sidgairo18
2d56d067e3 Fixed issue #648 2017-01-15 14:02:49 +05:30
Evgeny
e048c65cdb [CEA-708] Added BS command 2017-01-14 19:30:03 +03:00
Evgeny
41cd5f00bc Updated 708 window dump 2017-01-14 18:17:31 +03:00
AlexBratosin2001
7ab968c4a6 Merge remote-tracking branch 'upstream/master' 2017-01-13 23:35:59 +02:00
AlexBratosin2001
462f63a294 Fixed DVB multiprogram. 2017-01-13 23:35:34 +02:00
Carlos Fernandez
f75793c5e4 Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-01-13 11:32:22 -08:00
Carlos Fernandez
521ee29ab8 Corrections in dvbcolor help and code comments. 2017-01-13 11:32:11 -08:00
Carlos Fernandez Sanz
91690f9453 Merge pull request #643 from Izaron/708-stuff
CEA-708 subtitle counter
2017-01-13 09:55:12 -08:00
Evgeny
aa0db3c528 Using correct CEA-708 subs counter 2017-01-13 20:34:50 +03:00
Evgeny
a727d2df26 Fixed hot bug with CEA-708 counter 2017-01-13 20:32:29 +03:00
Carlos Fernandez Sanz
8f818051b0 Merge pull request #642 from Izaron/708-stuff
CEA-708 improvements
2017-01-13 06:54:12 -08:00
Evgeny
1fb98118c6 Added CEA-708 counter for EXIT_NO_CAPTIONS 2017-01-13 16:40:08 +03:00
Evgeny
967e2bc695 decoder->tv refactoring 2017-01-13 16:15:43 +03:00
Evgeny
0855c0a41d Added support of SAMI and TTXT in CEA-708 2017-01-13 16:04:37 +03:00
Evgeny
bff384e677 Refactored CEA-708 symbol struct 2017-01-13 15:46:41 +03:00
Evgeny
3b2545cf82 Fixed timing bug catching in CEA-708 2017-01-13 14:06:49 +03:00
Naman Yadav
ab31e7b4d4 Check for language tesseract data in /usr/share/tessdata/ (#638)
Closes https://github.com/CCExtractor/ccextractor/issues/448
2017-01-12 15:07:28 -08:00
gonzaloUran
cd17aa3a53 make -ignoreptsjumps and -dvbcolor default (#637)
* default-arguments

* default-arguments
2017-01-12 14:44:56 -08:00
Carlos Fernandez
d99fda59a3 Merge branch 'pr/n640_Izaron' 2017-01-12 10:33:19 -08:00
Evgeny
ddbd03760b Added max macro for non-Visual Studio IDE 2017-01-12 21:27:36 +03:00
Evgeny
7078f10150 Fixed 708 pen handling from line by line to correct 2017-01-12 21:07:16 +03:00
Siddhartha Gairola
6c733e96c9 Fix defects (#630)
* Fixed memory leaks issue #615

* Fixed memory leaks issue #615

* Update lib_ccx.c

* Fixing issue #629

* Update networking.c
2017-01-11 13:06:56 -08:00
Saurabh Shrivastava
720008f9fb Update params.c 2017-01-12 02:03:08 +05:30
AlexBratosin2001
ade11eb80f Fixed DVB multiprogram (timing is still broken) 2017-01-11 22:30:39 +02:00
Saurabh Shrivastava
d8a6642d5f Exit if input source is stdin for MP4. 2017-01-12 01:58:25 +05:30
Saurabh Shrivastava
2464064226 Stop GPAC from analyzing if input source is stdin. 2017-01-12 01:54:31 +05:30
Carlos Fernandez
591d74d0c5 Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-01-11 11:53:56 -08:00
Carlos Fernandez
14286a0025 Avoid calling fatal() on warning.
Minor indenting.
2017-01-11 11:53:43 -08:00
Evgeny
ddce5829d5 Musical note should replace 'Delete' in G0 Table 2017-01-11 19:18:32 +03:00
Evgeny
ebd9fc4bfe Minor mistake in bitsream fixed 2017-01-11 14:14:35 +03:00
gonzaloUran
6f8d99b39e pesheader-option (#628) 2017-01-10 16:56:30 -08:00
saurabhkapur
65634a18d1 Fixes #618 (#619) 2017-01-10 16:27:28 -08:00
Saurabh Shrivastava
827ace8dca SMPTE-TT : Removed appearance of garbage value in color code. (#625)
Also polished and improved existing code. More details : CCExtractor#620
2017-01-10 16:26:23 -08:00
AlexBratosin2001
baa5b0d14f Fixed error regarding last commit 2017-01-11 00:24:07 +02:00
AlexBratosin2001
b633491b91 Perfected DVB timing and cleaned up code 2017-01-11 00:00:06 +02:00
AlexBratosin2001
9f331b6a92 Merge remote-tracking branch 'upstream/master' 2017-01-10 23:59:16 +02:00
Carlos Fernandez
b5de22ff13 Solve crash caused by boxdestroy? 2017-01-10 13:06:34 -08:00
Carlos Fernandez
e4c9a95f7c Memory leaks in ocr.c 2017-01-10 12:59:23 -08:00
Carlos Fernandez
90001b6c23 Fix in memory free 2017-01-10 12:20:15 -08:00
Carlos Fernandez
19fec61902 Fix: Memory leak in ccx_encoders_smptett.c 2017-01-10 12:16:05 -08:00
Carlos Fernandez
acc63cc478 Merge branch 'pr/n623_Izaron' 2017-01-10 11:35:24 -08:00
Carlos Fernandez
4017b59f97 Init saw_caption_block in decoder context. 2017-01-10 11:18:18 -08:00
Evgeny
407a40e32e Fixed italics and underline bit flags 2017-01-10 22:17:13 +06:00
Evgeny
714700f6b5 Fixed column count to correct values 2017-01-10 22:03:50 +06:00
Evgeny
d60baf1895 Added support of UTF16 2017-01-10 12:12:13 +06:00
Carlos Fernandez
d7d7d62971 Updated CHANGES.TXT with the new stuff for 0.85. 2017-01-09 11:06:19 -08:00
AlexBratosin2001
737c0f4205 Added Alexandru Bratosin to README.TXT 2017-01-09 20:15:36 +02:00
Carlos Fernandez
766145275a Merge branch 'pr/n622_AlexBratosin2001'
Version change from 0.84 to 0.85.
2017-01-09 10:09:13 -08:00
Carlos Fernandez
7517a5448e Merge branch 'master' of https://github.com/CCExtractor/ccextractor 2017-01-09 10:04:27 -08:00
Carlos Fernandez
80a0f1a1c1 Changed where to report bugs. 2017-01-09 10:04:10 -08:00
Abhinav Shukla
5933194570 Progress activity and more context 2017-01-09 21:14:52 +05:30
AlexBratosin2001
898ce5bf87 Fixed Teletext related issues (+DVB) and added other stuff 2017-01-09 16:14:05 +02:00
Abhinav Shukla
fbc7cb5452 Setting up skeleton for tickertext based burned in detection 2017-01-09 17:51:15 +05:30