Marcin Cieślak
5137ebac0b
[tvp] Telewizja Polska: new extractor for tvp.pl, fixes #1719
...
Thanks-To: mplonski
https://github.com/mplonski/linux/blob/master/tvp-dl.py
2013-11-05 23:47:40 +01:00
Marcin Cieślak
a8eeb0597b
Fix AssertionError when og property not found
...
On tvp.pl some webpages contain OpenGraph
metadata and some don't.
If og property is not found, _og_search_description
fails with
WARNING: unable to extract OpenGraph description; please report this issue on http://yt-dl.org/bug
Traceback (most recent call last):
File "/usr/home/saper/bin/youtube-dl", line 18, in <module>
youtube_dl.main()
File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 766, in main
_real_main(argv)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 719, in _real_main
retcode = ydl.download(all_urls)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 715, in download
videos = self.extract_info(url)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 348, in extract_info
ie_result = ie.extract(url)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 125, in extract
return self._real_extract(url)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/tvp.py", line 56, in _real_extract
info['description'] = self._og_search_description(webpage)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 331, in _og_search_description
return self._og_search_property('description', html, fatal=False, **kargs)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 325, in _og_search_property
return unescapeHTML(escaped)
File "/usr/home/saper/sw/youtube-dl/youtube_dl/utils.py", line 494, in unescapeHTML
assert type(s) == type(u'')
AssertionError
The patch allows me to use:
try:
info['description'] = self._og_search_description(webpage)
info['thumbnail'] = self._og_search_thumbnail(webpage)
except RegexNotFoundError:
pass
2013-11-05 23:19:29 +01:00
Ismaël Mejía
4ed3e51080
[ted] fixed error in case of no subtitles present
...
I created a test, but I leave it commented since TED videos get
new subtitles frequently.
2013-11-05 12:00:13 +01:00
rzhxeo
2dcf7d8f99
[GenericIE] Also detect youtube if src url of iframe is embedded in ' instaed of "
2013-11-05 02:08:02 +01:00
Jaime Marquínez Ferrándiz
19b0668251
[canal2c] Accept more urls ( fixes #1723 )
...
The url only needs to have the 'idVideo' field in the query, in any position.
We have to set the 'void=oui' in the webpage url, so that we get the file name.
2013-11-04 22:26:19 +01:00
Jaime Marquínez Ferrándiz
e7e6b54d8a
[teamcoco] Parse the xml file and extract all the formats
2013-11-03 17:48:12 +01:00
Jaime Marquínez Ferrándiz
2a1a8ffe41
Merge pull request #1693 from alexvh/teamcoco_fix
...
[teamcoco] Fix video url extraction for some videos
2013-11-03 17:19:51 +01:00
Philipp Hagemeister
08fb86c49b
[youtube] Add description for YoutubeSearchDateIE ( #1710 )
2013-11-03 15:59:10 +01:00
Philipp Hagemeister
3633d77c0f
Merge remote-tracking branch 'CBGoodBuddy/ytsearchtime'
2013-11-03 15:56:55 +01:00
Philipp Hagemeister
165e179764
release 2013.11.03
2013-11-03 15:50:36 +01:00
Philipp Hagemeister
12ebdd1506
[viddler] Support non-digit IDs ( Fixes #1714 )
2013-11-03 15:49:59 +01:00
Jaime Marquínez Ferrándiz
1baf9a5938
Merge pull request #1698 from rzhxeo/cinemassacre
...
[CinemassacreIE] Support more embed urls
2013-11-03 05:17:12 -08:00
Jaime Marquínez Ferrándiz
a56f9de156
Style fixes for extractors: remove spaces around (,),{ and }
2013-11-03 14:06:47 +01:00
Jaime Marquínez Ferrándiz
fa5d47af4b
Merge pull request #1679 from rzhxeo/mofosex
...
Add support for http://www.mofosex.com
2013-11-03 05:04:14 -08:00
Jaime Marquínez Ferrándiz
d607038753
Merge pull request #1677 from rzhxeo/xtube
...
Add support for http://www.xtube.com
2013-11-03 03:28:02 -08:00
Jaime Marquínez Ferrándiz
9ac6a01aaf
Merge pull request #1676 from rzhxeo/extremetube
...
Add support for http://www.extremetube.com
2013-11-03 03:25:46 -08:00
Jaime Marquínez Ferrándiz
be97abc247
Set the 'extractor_key' field in the info_dict
...
It's the string returned by the class method 'ie_key', which allows to retrieve the extractor with 'get_info_extractor'
2013-11-03 12:14:44 +01:00
Jaime Marquínez Ferrándiz
9103bbc5cd
Add the 'webpage_url' field to info_dict
...
The url for the video page, it must allow to reproduce the result.
It's automatically set by YoutubeDL if it's missing.
2013-11-03 12:11:13 +01:00
Jaime Marquínez Ferrándiz
b6c45014ae
Set the extra_info inside YoutubeDL.process_ie_result and set only if the keys are missing
2013-11-03 11:57:04 +01:00
Craig Markwardt
a3dd924871
Add YoutubeSearchDateIE extractor to youtube.py & __init__.py, which searches by publication date.
2013-11-02 22:40:48 -04:00
rzhxeo
137bbb3e37
[XTubeIE] Add description to TEST
2013-11-02 22:45:48 +01:00
rzhxeo
86ad94bb2e
[ExtremeTubeIE] Set age_limit to 18 and fix uploader extraction
2013-11-02 22:33:49 +01:00
Jaime Marquínez Ferrándiz
3e56add7c9
Merge pull request #1678 from rzhxeo/keezmovies
...
[KeezMoviesIE] Detect URLs with numbers in the SEO part correct
2013-11-02 14:15:52 -07:00
Jaime Marquínez Ferrándiz
f52f01b5d2
[brightcove] Don't set the extension
...
If the video only has the 'FLVFullLengthURL' key, it can still be an mp4 file.
2013-11-02 21:20:46 +01:00
Jaime Marquínez Ferrándiz
98d7efb537
[exfm] skip tests
...
The site is down too often.
2013-11-02 20:51:09 +01:00
Jaime Marquínez Ferrándiz
cf51923545
[youtube] Remove vevo test
...
The video is no longer available and it seems that vevo video don't use encrypted signatures anymore.
2013-11-02 20:46:26 +01:00
Ismaël Mejía
38fcd4597a
Merge remote-tracking branch 'iemejia/master'
2013-11-02 19:56:06 +01:00
Jaime Marquínez Ferrándiz
165e3bb67a
[bambuser] Add an extractor for channels ( closes #1702 )
2013-11-02 19:50:57 +01:00
Ismaël Mejía
38db46794f
Merge branch 'ted_subtitles'
2013-11-02 19:50:45 +01:00
Ismaël Mejía
a9a3876d55
[ted] Added support for subtitle download
2013-11-02 19:48:39 +01:00
Ismaël Mejía
1f343eaabb
[subtitles] refactor to support websites with subtitle information the
...
webpage.
I added the parameter webpage, so now it's similar to the way automatic
captions are handled. This is an improvement needed for websites like
TED.
2013-11-02 19:29:25 +01:00
Jaime Marquínez Ferrándiz
72a5b4f702
Add an extractor for bambuser.com ( #1702 )
2013-11-02 19:01:01 +01:00
rzhxeo
0a43ddf320
[CinemassacreIE] Add live paramter to extracted info as a workaround
2013-11-02 18:08:35 +01:00
rzhxeo
31366066bd
Add support for live parameter to rtmpdump
2013-11-02 18:08:16 +01:00
Philipp Hagemeister
aa2484e390
release 2013.11.02
2013-11-02 11:21:36 +01:00
Philipp Hagemeister
8eddf3e91d
[youtube] Encode subtitle track name in request ( Fixes #1700 )
2013-11-02 11:21:05 +01:00
Jaime Marquínez Ferrándiz
60d142aa8d
Add an extractor for vk.com ( closes #1635 )
2013-11-01 22:34:18 +01:00
Jaime Marquínez Ferrándiz
66cf3ac342
[metacafe] Fix support for age-restricted videos ( fixes #1696 )
...
The 'Content-Type' header must be set for disabling the family filter.
The 'flashversion' cookie is only needed for AnyClip videos.
Added tests for standard metacafe videos and for age-restricted videos.
Also set the 'age_limit' field.
2013-11-01 11:56:15 +01:00
rzhxeo
ab4e151347
[CinemassacreIE] Support more embed urls
2013-11-01 01:24:23 +01:00
Alex Van't Hof
ac2547f5ff
[teamcoco] Fix video url extraction for some videos
...
Video url extraction failed for some videos,
e.g. http://teamcoco.com/video/old-time-baseball
The url extracted was also occasionally suboptimal quality,
e.g. http://teamcoco.com/video/louis-ck-interview-george-w-bush
2013-10-31 15:41:14 -04:00
Jaime Marquínez Ferrándiz
5f1ea943ab
[livestream] fix the extraction of events
...
It now uses a json dictionary from the webpage.
2013-10-31 08:07:26 +01:00
Philipp Hagemeister
9f1109a564
[dailymotion] Fix support for age-restricted videos ( Fixes #1688 )
2013-10-31 00:20:49 +01:00
Philipp Hagemeister
33b1d9595d
release 2013.10.30
2013-10-30 01:17:20 +01:00
Philipp Hagemeister
7193498811
Use index in formt string (Fixes vevo test on Python 2.6)
2013-10-30 01:17:00 +01:00
Philipp Hagemeister
72321ead7b
[vevo] Readd support for SMIL ( Fixes #1683 )
2013-10-30 01:14:17 +01:00
Philipp Hagemeister
b5d0d817bc
Remove superfluous space
2013-10-30 01:09:44 +01:00
Philipp Hagemeister
94badb2599
Fix output indenting for --list-formats
2013-10-30 01:09:26 +01:00
Filippo Valsorda
b9a836515f
Update the Vimeo test vector md5
...
confirmed that this is indeed the first 10241 (we went off by one with
byte range 0-10240) of the full, playing mp4, so they probably
reencoded or something
2013-10-29 16:44:35 -04:00
Jaime Marquínez Ferrándiz
21c924f406
[arte] Download the 'Originalversion' version if it's the only one available ( fixes #1682 )
2013-10-29 20:58:49 +01:00
Philipp Hagemeister
e54fd4b23b
[vevo] Add more format details
2013-10-29 15:10:09 +01:00
Philipp Hagemeister
57dd9a8f2f
Nicer --list-formats output
2013-10-29 15:09:45 +01:00
Philipp Hagemeister
912cbf5d4e
[vevo] Fix timestamp handling
...
( / 1000 is implicit float division )
2013-10-29 14:00:23 +01:00
Philipp Hagemeister
43d7895ea0
release 2013.10.29
2013-10-29 06:48:39 +01:00
Philipp Hagemeister
f7ff55aa78
Merge remote-tracking branch 'origin/master'
2013-10-29 06:48:18 +01:00
Philipp Hagemeister
795f28f871
[youtube] Fix login ( Fixes #1681 )
2013-10-29 06:45:54 +01:00
Jaime Marquínez Ferrándiz
321a01f971
[mtv] Remove the templates from the mediagen url
2013-10-28 23:37:01 +01:00
Jaime Marquínez Ferrándiz
2563bcc85c
Add an extractor for MySpace ( closes #1666 )
2013-10-28 22:02:17 +01:00
Jaime Marquínez Ferrándiz
702665c085
tests: build the filename from the info_dict if the 'file' key is missing
...
It will need to have the 'id' and 'ext' keys to work.
2013-10-28 22:01:37 +01:00
rzhxeo
dcc2a706ef
Add support for http://www.xtube.com
2013-10-28 19:23:48 +01:00
rzhxeo
2bc67c35ac
[KeezMoviesIE] Detect URLs with numbers in the SEO part correct
2013-10-28 18:22:55 +01:00
rzhxeo
77ae65877e
Add support for http://www.mofosex.com
2013-10-28 18:18:58 +01:00
rzhxeo
32a35e4418
Add support for http://www.extremetube.com
2013-10-28 17:35:01 +01:00
rzhxeo
216d71d001
Check if description and thumbnail are None to prevent crash
2013-10-28 16:28:35 +01:00
Philipp Hagemeister
78a3a9f89e
Make "requested format not available" expected ( #1655 )
2013-10-28 11:41:59 +01:00
Philipp Hagemeister
a7685f3bf4
mixcloud does not do any format selection
2013-10-28 11:41:32 +01:00
Philipp Hagemeister
f088ea5486
release 2013.10.28
2013-10-28 11:34:21 +01:00
Philipp Hagemeister
1003d108d5
[vimeo] Support hash in URL ( Fixes #1669 )
2013-10-28 11:32:22 +01:00
Philipp Hagemeister
8abeeb9449
Nicer --list-formats output
2013-10-28 11:31:12 +01:00
Philipp Hagemeister
c1002e96e9
Let extractors omit ext in formats
2013-10-28 11:28:02 +01:00
Philipp Hagemeister
77d0a82fef
[addanime] Use new formats system
2013-10-28 11:24:47 +01:00
Philipp Hagemeister
ebc14f251c
Merge remote-tracking branch 'origin/master'
2013-10-28 10:44:13 +01:00
Philipp Hagemeister
d41e6efc85
New debug option --write-pages
2013-10-28 10:44:02 +01:00
Filippo Valsorda
8ffa13e03e
[Instagram] get the non-https link, as they are serving Akamai cert from a instagram.com domain
2013-10-28 02:34:29 -04:00
Filippo Valsorda
750e9833b8
Add the missing age_limit tags; added a devscript to do a superficial check for porn sites without the age_limit tag in the test
2013-10-28 01:50:17 -04:00
Filippo Valsorda
82f0ac657c
Merge pull request #1657 by @rzhxeo
...
[YouPornIE] Extract all encrypted links and remove doubles at the end
2013-10-28 01:45:52 -04:00
Filippo Valsorda
eb6a2277a2
Merge pull request #1659 by @rzhxeo
...
Add support for http://www.tube8.com
2013-10-28 01:38:28 -04:00
Filippo Valsorda
f8778fb0fa
Merge pull request #1663 by @rzhxeo
...
Add support for http://www.spankwire.com
2013-10-28 01:35:11 -04:00
Filippo Valsorda
e2f9de207c
Merge pull request #1664 by @rzhxeo
...
Add support for http://www.keezmovies.com
2013-10-28 01:25:46 -04:00
Filippo Valsorda
a93cc0d943
Merge pull request #1661 by @rzhxeo
...
Add support for http://www.pornhub.com
2013-10-28 00:50:39 -04:00
Filippo Valsorda
7d8c2e07f2
[Exfm] replace the failing Soundcloud test vector (broken also in browser)
2013-10-28 00:33:43 -04:00
Filippo Valsorda
efb4c36b18
Merge pull request #1660 from pyed/master
...
[addanime] try to download HQ before normal
2013-10-27 21:14:19 -07:00
Jaime Marquínez Ferrándiz
29526d0d2b
Merge pull request #1656 from rzhxeo/xhamster
...
[XHamsterIE] Extract SD and HD video
2013-10-27 10:12:59 -07:00
Abdulelah Alfntokh
198e370f23
[addanime] better regex.
2013-10-27 19:48:02 +03:00
Jaime Marquínez Ferrándiz
c19f7764a5
[generic] Detect bandcamp pages that use custom domains ( closes #1662 )
...
They embed the original url in the 'og:url' property.
2013-10-27 14:40:25 +01:00
Jaime Marquínez Ferrándiz
bc63d9d329
[rtlnow] Change the test for rtlnitronow
2013-10-27 14:26:19 +01:00
Jaime Marquínez Ferrándiz
aa929c37d5
[generic] Fix test video's checksum
2013-10-27 14:21:37 +01:00
Jaime Marquínez Ferrándiz
af4d506eb3
[faz] Use a regex for getting the description
...
The page cannot be parsed in python2.6 with the html parser.
2013-10-27 14:18:55 +01:00
rzhxeo
5da0549581
[KeezMoviesIE] Correct return value for embedded videos
2013-10-27 12:48:09 +01:00
Jaime Marquínez Ferrándiz
749a4fd2fd
[facebook] Don't recommend to report the issue if the video is private.
2013-10-27 12:13:55 +01:00
Jaime Marquínez Ferrándiz
6f71ef580c
[facebook] Report a more meaningful message if the video cannot be accessed ( closes #1658 )
2013-10-27 12:09:46 +01:00
Jaime Marquínez Ferrándiz
67874aeffa
[facebook] Fix the login process ( fixes #1244 )
2013-10-27 12:07:58 +01:00
Abdulelah Alfntokh
3e6a330d38
[addanime] fix md5sum
2013-10-27 13:51:26 +03:00
Abdulelah Alfntokh
aee5e18c8f
[addanime] catch 'RegexNotFoundError'
2013-10-27 13:36:43 +03:00
rzhxeo
5b11143d05
Add support for http://www.keezmovies.com
2013-10-27 10:10:28 +01:00
rzhxeo
7b2212e954
Add support for http://www.spankwire.com
2013-10-27 01:59:26 +02:00
rzhxeo
71865091ab
[Tube8IE] Fix regex for uploader extraction
2013-10-27 01:08:03 +02:00
rzhxeo
125cfd78e8
Add support for http://www.pornhub.com
2013-10-27 01:04:22 +02:00
rzhxeo
8cb57d9b91
[Tube8IE] Escape dot in regex
2013-10-27 00:21:27 +02:00
pyed
14e10b2b6e
[addanime] try to download HQ before normal
2013-10-27 01:19:38 +03:00
rzhxeo
6e76104d66
[YouPornIE] Make webpage download more robust
2013-10-26 23:33:32 +02:00