mirror of
https://github.com/l1ving/youtube-dl
synced 2025-03-13 05:47:26 +08:00
commit
a44b30c784
6
.github/ISSUE_TEMPLATE.md
vendored
6
.github/ISSUE_TEMPLATE.md
vendored
@ -6,8 +6,8 @@
|
|||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2017.10.01*. If it's not, read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected.
|
### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2017.10.20*. If it's not, read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected.
|
||||||
- [ ] I've **verified** and **I assure** that I'm running youtube-dl **2017.10.01**
|
- [ ] I've **verified** and **I assure** that I'm running youtube-dl **2017.10.20**
|
||||||
|
|
||||||
### Before submitting an *issue* make sure you have:
|
### Before submitting an *issue* make sure you have:
|
||||||
- [ ] At least skimmed through the [README](https://github.com/rg3/youtube-dl/blob/master/README.md), **most notably** the [FAQ](https://github.com/rg3/youtube-dl#faq) and [BUGS](https://github.com/rg3/youtube-dl#bugs) sections
|
- [ ] At least skimmed through the [README](https://github.com/rg3/youtube-dl/blob/master/README.md), **most notably** the [FAQ](https://github.com/rg3/youtube-dl#faq) and [BUGS](https://github.com/rg3/youtube-dl#bugs) sections
|
||||||
@ -35,7 +35,7 @@ Add the `-v` flag to **your command line** you run youtube-dl with (`youtube-dl
|
|||||||
[debug] User config: []
|
[debug] User config: []
|
||||||
[debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj']
|
[debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj']
|
||||||
[debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251
|
[debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251
|
||||||
[debug] youtube-dl version 2017.10.01
|
[debug] youtube-dl version 2017.10.20
|
||||||
[debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2
|
[debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2
|
||||||
[debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4
|
[debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4
|
||||||
[debug] Proxy map: {}
|
[debug] Proxy map: {}
|
||||||
|
1
.github/PULL_REQUEST_TEMPLATE.md
vendored
1
.github/PULL_REQUEST_TEMPLATE.md
vendored
@ -9,6 +9,7 @@
|
|||||||
### Before submitting a *pull request* make sure you have:
|
### Before submitting a *pull request* make sure you have:
|
||||||
- [ ] At least skimmed through [adding new extractor tutorial](https://github.com/rg3/youtube-dl#adding-support-for-a-new-site) and [youtube-dl coding conventions](https://github.com/rg3/youtube-dl#youtube-dl-coding-conventions) sections
|
- [ ] At least skimmed through [adding new extractor tutorial](https://github.com/rg3/youtube-dl#adding-support-for-a-new-site) and [youtube-dl coding conventions](https://github.com/rg3/youtube-dl#youtube-dl-coding-conventions) sections
|
||||||
- [ ] [Searched](https://github.com/rg3/youtube-dl/search?q=is%3Apr&type=Issues) the bugtracker for similar pull requests
|
- [ ] [Searched](https://github.com/rg3/youtube-dl/search?q=is%3Apr&type=Issues) the bugtracker for similar pull requests
|
||||||
|
- [ ] Checked the code with [flake8](https://pypi.python.org/pypi/flake8)
|
||||||
|
|
||||||
### In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under [Unlicense](http://unlicense.org/). Check one of the following options:
|
### In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under [Unlicense](http://unlicense.org/). Check one of the following options:
|
||||||
- [ ] I am the original author of this code and I am willing to release it under [Unlicense](http://unlicense.org/)
|
- [ ] I am the original author of this code and I am willing to release it under [Unlicense](http://unlicense.org/)
|
||||||
|
@ -11,12 +11,12 @@ sudo: false
|
|||||||
env:
|
env:
|
||||||
- YTDL_TEST_SET=core
|
- YTDL_TEST_SET=core
|
||||||
- YTDL_TEST_SET=download
|
- YTDL_TEST_SET=download
|
||||||
|
matrix:
|
||||||
|
fast_finish: true
|
||||||
|
allow_failures:
|
||||||
|
- env: YTDL_TEST_SET=download
|
||||||
script: ./devscripts/run_tests.sh
|
script: ./devscripts/run_tests.sh
|
||||||
notifications:
|
notifications:
|
||||||
email:
|
email:
|
||||||
- filippo.valsorda@gmail.com
|
- filippo.valsorda@gmail.com
|
||||||
- yasoob.khld@gmail.com
|
- yasoob.khld@gmail.com
|
||||||
# irc:
|
|
||||||
# channels:
|
|
||||||
# - "irc.freenode.org#youtube-dl"
|
|
||||||
# skip_join: true
|
|
||||||
|
104
ChangeLog
104
ChangeLog
@ -1,3 +1,105 @@
|
|||||||
|
version 2017.10.20
|
||||||
|
|
||||||
|
Core
|
||||||
|
* [downloader/fragment] Report warning instead of error on inconsistent
|
||||||
|
download state
|
||||||
|
* [downloader/hls] Fix total fragments count when ad fragments exist
|
||||||
|
|
||||||
|
Extractors
|
||||||
|
* [parliamentliveuk] Fix extraction (#14524)
|
||||||
|
* [soundcloud] Update client id (#14546)
|
||||||
|
+ [servus] Add support for servus.com (#14362)
|
||||||
|
+ [unity] Add support for unity3d.com (#14528)
|
||||||
|
* [youtube] Replace youtube redirect URLs in description (#14517)
|
||||||
|
* [pbs] Restrict direct video URL regular expression (#14519)
|
||||||
|
* [drtv] Respect preference for direct HTTP formats (#14509)
|
||||||
|
+ [eporner] Add support for embed URLs (#14507)
|
||||||
|
* [arte] Capture and output error message
|
||||||
|
* [niconico] Improve uploader metadata extraction robustness (#14135)
|
||||||
|
|
||||||
|
|
||||||
|
version 2017.10.15.1
|
||||||
|
|
||||||
|
Core
|
||||||
|
* [downloader/hls] Ignore anvato ad fragments (#14496)
|
||||||
|
* [downloader/fragment] Output ad fragment count
|
||||||
|
|
||||||
|
Extractors
|
||||||
|
* [scrippsnetworks:watch] Bypass geo restriction
|
||||||
|
+ [anvato] Add ability to bypass geo restriction
|
||||||
|
* [redditr] Fix extraction for URLs with query (#14495)
|
||||||
|
|
||||||
|
|
||||||
|
version 2017.10.15
|
||||||
|
|
||||||
|
Core
|
||||||
|
+ [common] Add support for jwplayer youtube embeds
|
||||||
|
|
||||||
|
Extractors
|
||||||
|
* [scrippsnetworks:watch] Fix extraction (#14389)
|
||||||
|
* [anvato] Process master m3u8 manifests
|
||||||
|
* [youtube] Fix relative URLs in description
|
||||||
|
* [spike] Bypass geo restriction
|
||||||
|
+ [howstuffworks] Add support for more domains
|
||||||
|
* [infoq] Fix http format downloading
|
||||||
|
+ [rtlnl] Add support for another type of embeds
|
||||||
|
+ [onionstudios] Add support for bulbs-video embeds
|
||||||
|
* [udn] Fix extraction
|
||||||
|
* [shahid] Fix extraction (#14448)
|
||||||
|
* [kaltura] Ignore Widevine encrypted video (.wvm) (#14471)
|
||||||
|
* [vh1] Fix extraction (#9613)
|
||||||
|
|
||||||
|
|
||||||
|
version 2017.10.12
|
||||||
|
|
||||||
|
Core
|
||||||
|
* [YoutubeDL] Improve _default_format_spec (#14461)
|
||||||
|
|
||||||
|
Extractors
|
||||||
|
* [steam] Fix extraction (#14067)
|
||||||
|
+ [funk] Add support for funk.net (#14464)
|
||||||
|
+ [nexx] Add support for shortcuts and relax domain id extraction
|
||||||
|
+ [voxmedia] Add support for recode.net (#14173)
|
||||||
|
+ [once] Add support for vmap URLs
|
||||||
|
+ [generic] Add support for channel9 embeds (#14469)
|
||||||
|
* [tva] Fix extraction (#14328)
|
||||||
|
+ [tubitv] Add support for new URL format (#14460)
|
||||||
|
- [afreecatv:global] Remove extractor
|
||||||
|
- [youtube:shared] Removed extractor (#14420)
|
||||||
|
+ [slideslive] Add support for slideslive.com (#2680)
|
||||||
|
+ [facebook] Support thumbnails (#14416)
|
||||||
|
* [vvvvid] Fix episode number extraction (#14456)
|
||||||
|
* [hrti:playlist] Relax URL regular expression
|
||||||
|
* [wdr] Relax media link regular expression (#14447)
|
||||||
|
* [hrti] Relax URL regular expression (#14443)
|
||||||
|
* [fox] Delegate extraction to uplynk:preplay (#14147)
|
||||||
|
+ [youtube] Add support for hooktube.com (#14437)
|
||||||
|
|
||||||
|
|
||||||
|
version 2017.10.07
|
||||||
|
|
||||||
|
Core
|
||||||
|
* [YoutubeDL] Ignore duplicates in --playlist-items
|
||||||
|
* [YoutubeDL] Fix out of range --playlist-items for iterable playlists and
|
||||||
|
reduce code duplication (#14425)
|
||||||
|
+ [utils] Use cache in OnDemandPagedList by default
|
||||||
|
* [postprocessor/ffmpeg] Convert to opus using libopus (#14381)
|
||||||
|
|
||||||
|
Extractors
|
||||||
|
* [reddit] Sort formats (#14430)
|
||||||
|
* [lnkgo] Relax URL regular expression (#14423)
|
||||||
|
* [pornflip] Extend URL regular expression (#14405, #14406)
|
||||||
|
+ [xtube] Add support for embed URLs (#14417)
|
||||||
|
+ [xvideos] Add support for embed URLs and improve extraction (#14409)
|
||||||
|
* [beeg] Fix extraction (#14403)
|
||||||
|
* [tvn24] Relax URL regular expression (#14395)
|
||||||
|
* [nbc] Fix extraction (#13651, #13715, #14137, #14198, #14312, #14314, #14378,
|
||||||
|
#14392, #14414, #14419, #14431)
|
||||||
|
+ [ketnet] Add support for videos without direct sources (#14377)
|
||||||
|
* [canvas] Generalize mediazone.vrt.be extractor and rework canvas and een
|
||||||
|
+ [afreecatv] Add support for adult videos (#14376)
|
||||||
|
|
||||||
|
|
||||||
version 2017.10.01
|
version 2017.10.01
|
||||||
|
|
||||||
Core
|
Core
|
||||||
@ -752,7 +854,7 @@ version 2017.04.14
|
|||||||
|
|
||||||
Core
|
Core
|
||||||
+ [downloader/hls] Add basic support for EXT-X-BYTERANGE tag (#10955)
|
+ [downloader/hls] Add basic support for EXT-X-BYTERANGE tag (#10955)
|
||||||
+ [adobepass] Improve Comcast and Verison login code (#10803)
|
+ [adobepass] Improve Comcast and Verizon login code (#10803)
|
||||||
+ [adobepass] Add support for Verizon (#10803)
|
+ [adobepass] Add support for Verizon (#10803)
|
||||||
|
|
||||||
Extractors
|
Extractors
|
||||||
|
@ -1,3 +1,5 @@
|
|||||||
|
[](https://travis-ci.org/rg3/youtube-dl)
|
||||||
|
|
||||||
youtube-dl - download videos from youtube.com or other video platforms
|
youtube-dl - download videos from youtube.com or other video platforms
|
||||||
|
|
||||||
- [INSTALLATION](#installation)
|
- [INSTALLATION](#installation)
|
||||||
|
@ -36,7 +36,6 @@
|
|||||||
- **AdultSwim**
|
- **AdultSwim**
|
||||||
- **aenetworks**: A+E Networks: A&E, Lifetime, History.com, FYI Network
|
- **aenetworks**: A+E Networks: A&E, Lifetime, History.com, FYI Network
|
||||||
- **afreecatv**: afreecatv.com
|
- **afreecatv**: afreecatv.com
|
||||||
- **afreecatv:global**: afreecatv.com
|
|
||||||
- **AirMozilla**
|
- **AirMozilla**
|
||||||
- **AliExpressLive**
|
- **AliExpressLive**
|
||||||
- **AlJazeera**
|
- **AlJazeera**
|
||||||
@ -130,7 +129,8 @@
|
|||||||
- **CamWithHer**
|
- **CamWithHer**
|
||||||
- **canalc2.tv**
|
- **canalc2.tv**
|
||||||
- **Canalplus**: canalplus.fr, piwiplus.fr and d8.tv
|
- **Canalplus**: canalplus.fr, piwiplus.fr and d8.tv
|
||||||
- **Canvas**: canvas.be and een.be
|
- **Canvas**
|
||||||
|
- **CanvasEen**: canvas.be and een.be
|
||||||
- **CarambaTV**
|
- **CarambaTV**
|
||||||
- **CarambaTVPage**
|
- **CarambaTVPage**
|
||||||
- **CartoonNetwork**
|
- **CartoonNetwork**
|
||||||
@ -295,6 +295,7 @@
|
|||||||
- **freespeech.org**
|
- **freespeech.org**
|
||||||
- **FreshLive**
|
- **FreshLive**
|
||||||
- **Funimation**
|
- **Funimation**
|
||||||
|
- **Funk**
|
||||||
- **FunnyOrDie**
|
- **FunnyOrDie**
|
||||||
- **Fusion**
|
- **Fusion**
|
||||||
- **Fux**
|
- **Fux**
|
||||||
@ -727,6 +728,7 @@
|
|||||||
- **SenateISVP**
|
- **SenateISVP**
|
||||||
- **SendtoNews**
|
- **SendtoNews**
|
||||||
- **ServingSys**
|
- **ServingSys**
|
||||||
|
- **Servus**
|
||||||
- **Sexu**
|
- **Sexu**
|
||||||
- **Shahid**
|
- **Shahid**
|
||||||
- **Shared**: shared.sx
|
- **Shared**: shared.sx
|
||||||
@ -737,6 +739,7 @@
|
|||||||
- **skynewsarabia:video**
|
- **skynewsarabia:video**
|
||||||
- **SkySports**
|
- **SkySports**
|
||||||
- **Slideshare**
|
- **Slideshare**
|
||||||
|
- **SlidesLive**
|
||||||
- **Slutload**
|
- **Slutload**
|
||||||
- **smotri**: Smotri.com
|
- **smotri**: Smotri.com
|
||||||
- **smotri:broadcast**: Smotri.com broadcasts
|
- **smotri:broadcast**: Smotri.com broadcasts
|
||||||
@ -885,6 +888,7 @@
|
|||||||
- **UDNEmbed**: 聯合影音
|
- **UDNEmbed**: 聯合影音
|
||||||
- **UKTVPlay**
|
- **UKTVPlay**
|
||||||
- **Unistra**
|
- **Unistra**
|
||||||
|
- **Unity**
|
||||||
- **uol.com.br**
|
- **uol.com.br**
|
||||||
- **uplynk**
|
- **uplynk**
|
||||||
- **uplynk:preplay**
|
- **uplynk:preplay**
|
||||||
@ -968,6 +972,7 @@
|
|||||||
- **VoiceRepublic**
|
- **VoiceRepublic**
|
||||||
- **Voot**
|
- **Voot**
|
||||||
- **VoxMedia**
|
- **VoxMedia**
|
||||||
|
- **VoxMediaVolume**
|
||||||
- **Vporn**
|
- **Vporn**
|
||||||
- **vpro**: npo.nl, ntr.nl, omroepwnl.nl, zapp.nl and npo3.nl
|
- **vpro**: npo.nl, ntr.nl, omroepwnl.nl, zapp.nl and npo3.nl
|
||||||
- **Vrak**
|
- **Vrak**
|
||||||
@ -1043,7 +1048,6 @@
|
|||||||
- **youtube:search**: YouTube.com searches
|
- **youtube:search**: YouTube.com searches
|
||||||
- **youtube:search:date**: YouTube.com searches, newest videos first
|
- **youtube:search:date**: YouTube.com searches, newest videos first
|
||||||
- **youtube:search_url**: YouTube.com search URLs
|
- **youtube:search_url**: YouTube.com search URLs
|
||||||
- **youtube:shared**
|
|
||||||
- **youtube:show**: YouTube.com (multi-season) shows
|
- **youtube:show**: YouTube.com (multi-season) shows
|
||||||
- **youtube:subscriptions**: YouTube.com subscriptions feed, "ytsubs" keyword (requires authentication)
|
- **youtube:subscriptions**: YouTube.com subscriptions feed, "ytsubs" keyword (requires authentication)
|
||||||
- **youtube:user**: YouTube.com user videos (URL or "ytuser" keyword)
|
- **youtube:user**: YouTube.com user videos (URL or "ytuser" keyword)
|
||||||
|
@ -466,12 +466,18 @@ class TestFormatSelection(unittest.TestCase):
|
|||||||
ydl = YDL({'simulate': True})
|
ydl = YDL({'simulate': True})
|
||||||
self.assertEqual(ydl._default_format_spec({}), 'bestvideo+bestaudio/best')
|
self.assertEqual(ydl._default_format_spec({}), 'bestvideo+bestaudio/best')
|
||||||
|
|
||||||
|
ydl = YDL({'is_live': True})
|
||||||
|
self.assertEqual(ydl._default_format_spec({}), 'best/bestvideo+bestaudio')
|
||||||
|
|
||||||
|
ydl = YDL({'simulate': True, 'is_live': True})
|
||||||
|
self.assertEqual(ydl._default_format_spec({}), 'bestvideo+bestaudio/best')
|
||||||
|
|
||||||
ydl = YDL({'outtmpl': '-'})
|
ydl = YDL({'outtmpl': '-'})
|
||||||
self.assertEqual(ydl._default_format_spec({}), 'best')
|
self.assertEqual(ydl._default_format_spec({}), 'best/bestvideo+bestaudio')
|
||||||
|
|
||||||
ydl = YDL({})
|
ydl = YDL({})
|
||||||
self.assertEqual(ydl._default_format_spec({}, download=False), 'bestvideo+bestaudio/best')
|
self.assertEqual(ydl._default_format_spec({}, download=False), 'bestvideo+bestaudio/best')
|
||||||
self.assertEqual(ydl._default_format_spec({'is_live': True}), 'best')
|
self.assertEqual(ydl._default_format_spec({'is_live': True}), 'best/bestvideo+bestaudio')
|
||||||
|
|
||||||
|
|
||||||
class TestYoutubeDL(unittest.TestCase):
|
class TestYoutubeDL(unittest.TestCase):
|
||||||
@ -770,6 +776,12 @@ class TestYoutubeDL(unittest.TestCase):
|
|||||||
result = get_ids({'playlist_items': '10'})
|
result = get_ids({'playlist_items': '10'})
|
||||||
self.assertEqual(result, [])
|
self.assertEqual(result, [])
|
||||||
|
|
||||||
|
result = get_ids({'playlist_items': '3-10'})
|
||||||
|
self.assertEqual(result, [3, 4])
|
||||||
|
|
||||||
|
result = get_ids({'playlist_items': '2-4,3-4,3'})
|
||||||
|
self.assertEqual(result, [2, 3, 4])
|
||||||
|
|
||||||
def test_urlopen_no_file_protocol(self):
|
def test_urlopen_no_file_protocol(self):
|
||||||
# see https://github.com/rg3/youtube-dl/issues/8227
|
# see https://github.com/rg3/youtube-dl/issues/8227
|
||||||
ydl = YDL()
|
ydl = YDL()
|
||||||
|
@ -65,6 +65,7 @@ from .utils import (
|
|||||||
locked_file,
|
locked_file,
|
||||||
make_HTTPS_handler,
|
make_HTTPS_handler,
|
||||||
MaxDownloadsReached,
|
MaxDownloadsReached,
|
||||||
|
orderedSet,
|
||||||
PagedList,
|
PagedList,
|
||||||
parse_filesize,
|
parse_filesize,
|
||||||
PerRequestProxyHandler,
|
PerRequestProxyHandler,
|
||||||
@ -908,15 +909,25 @@ class YoutubeDL(object):
|
|||||||
yield int(item)
|
yield int(item)
|
||||||
else:
|
else:
|
||||||
yield int(string_segment)
|
yield int(string_segment)
|
||||||
playlistitems = iter_playlistitems(playlistitems_str)
|
playlistitems = orderedSet(iter_playlistitems(playlistitems_str))
|
||||||
|
|
||||||
ie_entries = ie_result['entries']
|
ie_entries = ie_result['entries']
|
||||||
|
|
||||||
|
def make_playlistitems_entries(list_ie_entries):
|
||||||
|
num_entries = len(list_ie_entries)
|
||||||
|
return [
|
||||||
|
list_ie_entries[i - 1] for i in playlistitems
|
||||||
|
if -num_entries <= i - 1 < num_entries]
|
||||||
|
|
||||||
|
def report_download(num_entries):
|
||||||
|
self.to_screen(
|
||||||
|
'[%s] playlist %s: Downloading %d videos' %
|
||||||
|
(ie_result['extractor'], playlist, num_entries))
|
||||||
|
|
||||||
if isinstance(ie_entries, list):
|
if isinstance(ie_entries, list):
|
||||||
n_all_entries = len(ie_entries)
|
n_all_entries = len(ie_entries)
|
||||||
if playlistitems:
|
if playlistitems:
|
||||||
entries = [
|
entries = make_playlistitems_entries(ie_entries)
|
||||||
ie_entries[i - 1] for i in playlistitems
|
|
||||||
if -n_all_entries <= i - 1 < n_all_entries]
|
|
||||||
else:
|
else:
|
||||||
entries = ie_entries[playliststart:playlistend]
|
entries = ie_entries[playliststart:playlistend]
|
||||||
n_entries = len(entries)
|
n_entries = len(entries)
|
||||||
@ -934,20 +945,15 @@ class YoutubeDL(object):
|
|||||||
entries = ie_entries.getslice(
|
entries = ie_entries.getslice(
|
||||||
playliststart, playlistend)
|
playliststart, playlistend)
|
||||||
n_entries = len(entries)
|
n_entries = len(entries)
|
||||||
self.to_screen(
|
report_download(n_entries)
|
||||||
'[%s] playlist %s: Downloading %d videos' %
|
|
||||||
(ie_result['extractor'], playlist, n_entries))
|
|
||||||
else: # iterable
|
else: # iterable
|
||||||
if playlistitems:
|
if playlistitems:
|
||||||
entry_list = list(ie_entries)
|
entries = make_playlistitems_entries(list(ie_entries))
|
||||||
entries = [entry_list[i - 1] for i in playlistitems]
|
|
||||||
else:
|
else:
|
||||||
entries = list(itertools.islice(
|
entries = list(itertools.islice(
|
||||||
ie_entries, playliststart, playlistend))
|
ie_entries, playliststart, playlistend))
|
||||||
n_entries = len(entries)
|
n_entries = len(entries)
|
||||||
self.to_screen(
|
report_download(n_entries)
|
||||||
'[%s] playlist %s: Downloading %d videos' %
|
|
||||||
(ie_result['extractor'], playlist, n_entries))
|
|
||||||
|
|
||||||
if self.params.get('playlistreverse', False):
|
if self.params.get('playlistreverse', False):
|
||||||
entries = entries[::-1]
|
entries = entries[::-1]
|
||||||
@ -1072,22 +1078,27 @@ class YoutubeDL(object):
|
|||||||
return _filter
|
return _filter
|
||||||
|
|
||||||
def _default_format_spec(self, info_dict, download=True):
|
def _default_format_spec(self, info_dict, download=True):
|
||||||
req_format_list = []
|
|
||||||
|
|
||||||
def can_have_partial_formats():
|
def can_merge():
|
||||||
if self.params.get('simulate', False):
|
|
||||||
return True
|
|
||||||
if not download:
|
|
||||||
return True
|
|
||||||
if self.params.get('outtmpl', DEFAULT_OUTTMPL) == '-':
|
|
||||||
return False
|
|
||||||
if info_dict.get('is_live'):
|
|
||||||
return False
|
|
||||||
merger = FFmpegMergerPP(self)
|
merger = FFmpegMergerPP(self)
|
||||||
return merger.available and merger.can_merge()
|
return merger.available and merger.can_merge()
|
||||||
if can_have_partial_formats():
|
|
||||||
req_format_list.append('bestvideo+bestaudio')
|
def prefer_best():
|
||||||
req_format_list.append('best')
|
if self.params.get('simulate', False):
|
||||||
|
return False
|
||||||
|
if not download:
|
||||||
|
return False
|
||||||
|
if self.params.get('outtmpl', DEFAULT_OUTTMPL) == '-':
|
||||||
|
return True
|
||||||
|
if info_dict.get('is_live'):
|
||||||
|
return True
|
||||||
|
if not can_merge():
|
||||||
|
return True
|
||||||
|
return False
|
||||||
|
|
||||||
|
req_format_list = ['bestvideo+bestaudio', 'best']
|
||||||
|
if prefer_best():
|
||||||
|
req_format_list.reverse()
|
||||||
return '/'.join(req_format_list)
|
return '/'.join(req_format_list)
|
||||||
|
|
||||||
def build_format_selector(self, format_spec):
|
def build_format_selector(self, format_spec):
|
||||||
|
@ -117,9 +117,15 @@ class FragmentFD(FileDownloader):
|
|||||||
def _prepare_frag_download(self, ctx):
|
def _prepare_frag_download(self, ctx):
|
||||||
if 'live' not in ctx:
|
if 'live' not in ctx:
|
||||||
ctx['live'] = False
|
ctx['live'] = False
|
||||||
|
if not ctx['live']:
|
||||||
|
total_frags_str = '%d' % ctx['total_frags']
|
||||||
|
ad_frags = ctx.get('ad_frags', 0)
|
||||||
|
if ad_frags:
|
||||||
|
total_frags_str += ' (not including %d ad)' % ad_frags
|
||||||
|
else:
|
||||||
|
total_frags_str = 'unknown (live)'
|
||||||
self.to_screen(
|
self.to_screen(
|
||||||
'[%s] Total fragments: %s'
|
'[%s] Total fragments: %s' % (self.FD_NAME, total_frags_str))
|
||||||
% (self.FD_NAME, ctx['total_frags'] if not ctx['live'] else 'unknown (live)'))
|
|
||||||
self.report_destination(ctx['filename'])
|
self.report_destination(ctx['filename'])
|
||||||
dl = HttpQuietDownloader(
|
dl = HttpQuietDownloader(
|
||||||
self.ydl,
|
self.ydl,
|
||||||
@ -152,7 +158,7 @@ class FragmentFD(FileDownloader):
|
|||||||
if os.path.isfile(encodeFilename(self.ytdl_filename(ctx['filename']))):
|
if os.path.isfile(encodeFilename(self.ytdl_filename(ctx['filename']))):
|
||||||
self._read_ytdl_file(ctx)
|
self._read_ytdl_file(ctx)
|
||||||
if ctx['fragment_index'] > 0 and resume_len == 0:
|
if ctx['fragment_index'] > 0 and resume_len == 0:
|
||||||
self.report_error(
|
self.report_warning(
|
||||||
'Inconsistent state of incomplete fragment download. '
|
'Inconsistent state of incomplete fragment download. '
|
||||||
'Restarting from the beginning...')
|
'Restarting from the beginning...')
|
||||||
ctx['fragment_index'] = resume_len = 0
|
ctx['fragment_index'] = resume_len = 0
|
||||||
|
@ -75,15 +75,30 @@ class HlsFD(FragmentFD):
|
|||||||
fd.add_progress_hook(ph)
|
fd.add_progress_hook(ph)
|
||||||
return fd.real_download(filename, info_dict)
|
return fd.real_download(filename, info_dict)
|
||||||
|
|
||||||
total_frags = 0
|
def anvato_ad(s):
|
||||||
|
return s.startswith('#ANVATO-SEGMENT-INFO') and 'type=ad' in s
|
||||||
|
|
||||||
|
media_frags = 0
|
||||||
|
ad_frags = 0
|
||||||
|
ad_frag_next = False
|
||||||
for line in s.splitlines():
|
for line in s.splitlines():
|
||||||
line = line.strip()
|
line = line.strip()
|
||||||
if line and not line.startswith('#'):
|
if not line:
|
||||||
total_frags += 1
|
continue
|
||||||
|
if line.startswith('#'):
|
||||||
|
if anvato_ad(line):
|
||||||
|
ad_frags += 1
|
||||||
|
ad_frag_next = True
|
||||||
|
continue
|
||||||
|
if ad_frag_next:
|
||||||
|
ad_frag_next = False
|
||||||
|
continue
|
||||||
|
media_frags += 1
|
||||||
|
|
||||||
ctx = {
|
ctx = {
|
||||||
'filename': filename,
|
'filename': filename,
|
||||||
'total_frags': total_frags,
|
'total_frags': media_frags,
|
||||||
|
'ad_frags': ad_frags,
|
||||||
}
|
}
|
||||||
|
|
||||||
self._prepare_and_start_frag_download(ctx)
|
self._prepare_and_start_frag_download(ctx)
|
||||||
@ -101,10 +116,14 @@ class HlsFD(FragmentFD):
|
|||||||
decrypt_info = {'METHOD': 'NONE'}
|
decrypt_info = {'METHOD': 'NONE'}
|
||||||
byte_range = {}
|
byte_range = {}
|
||||||
frag_index = 0
|
frag_index = 0
|
||||||
|
ad_frag_next = False
|
||||||
for line in s.splitlines():
|
for line in s.splitlines():
|
||||||
line = line.strip()
|
line = line.strip()
|
||||||
if line:
|
if line:
|
||||||
if not line.startswith('#'):
|
if not line.startswith('#'):
|
||||||
|
if ad_frag_next:
|
||||||
|
ad_frag_next = False
|
||||||
|
continue
|
||||||
frag_index += 1
|
frag_index += 1
|
||||||
if frag_index <= ctx['fragment_index']:
|
if frag_index <= ctx['fragment_index']:
|
||||||
continue
|
continue
|
||||||
@ -175,6 +194,8 @@ class HlsFD(FragmentFD):
|
|||||||
'start': sub_range_start,
|
'start': sub_range_start,
|
||||||
'end': sub_range_start + int(splitted_byte_range[0]),
|
'end': sub_range_start + int(splitted_byte_range[0]),
|
||||||
}
|
}
|
||||||
|
elif anvato_ad(line):
|
||||||
|
ad_frag_next = True
|
||||||
|
|
||||||
self._finish_frag_download(ctx)
|
self._finish_frag_download(ctx)
|
||||||
|
|
||||||
|
@ -131,7 +131,7 @@ class AENetworksIE(AENetworksBaseIE):
|
|||||||
r'data-media-url=(["\'])(?P<url>(?:(?!\1).)+?)\1'],
|
r'data-media-url=(["\'])(?P<url>(?:(?!\1).)+?)\1'],
|
||||||
webpage, 'video url', group='url')
|
webpage, 'video url', group='url')
|
||||||
theplatform_metadata = self._download_theplatform_metadata(self._search_regex(
|
theplatform_metadata = self._download_theplatform_metadata(self._search_regex(
|
||||||
r'https?://link.theplatform.com/s/([^?]+)', media_url, 'theplatform_path'), video_id)
|
r'https?://link\.theplatform\.com/s/([^?]+)', media_url, 'theplatform_path'), video_id)
|
||||||
info = self._parse_theplatform_metadata(theplatform_metadata)
|
info = self._parse_theplatform_metadata(theplatform_metadata)
|
||||||
if theplatform_metadata.get('AETN$isBehindWall'):
|
if theplatform_metadata.get('AETN$isBehindWall'):
|
||||||
requestor_id = self._DOMAIN_TO_REQUESTOR_ID[domain]
|
requestor_id = self._DOMAIN_TO_REQUESTOR_ID[domain]
|
||||||
|
@ -271,107 +271,3 @@ class AfreecaTVIE(InfoExtractor):
|
|||||||
})
|
})
|
||||||
|
|
||||||
return info
|
return info
|
||||||
|
|
||||||
|
|
||||||
class AfreecaTVGlobalIE(AfreecaTVIE):
|
|
||||||
IE_NAME = 'afreecatv:global'
|
|
||||||
_VALID_URL = r'https?://(?:www\.)?afreeca\.tv/(?P<channel_id>\d+)(?:/v/(?P<video_id>\d+))?'
|
|
||||||
_TESTS = [{
|
|
||||||
'url': 'http://afreeca.tv/36853014/v/58301',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '58301',
|
|
||||||
'title': 'tryhard top100',
|
|
||||||
'uploader_id': '36853014',
|
|
||||||
'uploader': 'makgi Hearthstone Live!',
|
|
||||||
},
|
|
||||||
'playlist_count': 3,
|
|
||||||
}]
|
|
||||||
|
|
||||||
def _real_extract(self, url):
|
|
||||||
channel_id, video_id = re.match(self._VALID_URL, url).groups()
|
|
||||||
video_type = 'video' if video_id else 'live'
|
|
||||||
query = {
|
|
||||||
'pt': 'view',
|
|
||||||
'bid': channel_id,
|
|
||||||
}
|
|
||||||
if video_id:
|
|
||||||
query['vno'] = video_id
|
|
||||||
video_data = self._download_json(
|
|
||||||
'http://api.afreeca.tv/%s/view_%s.php' % (video_type, video_type),
|
|
||||||
video_id or channel_id, query=query)['channel']
|
|
||||||
|
|
||||||
if video_data.get('result') != 1:
|
|
||||||
raise ExtractorError('%s said: %s' % (self.IE_NAME, video_data['remsg']))
|
|
||||||
|
|
||||||
title = video_data['title']
|
|
||||||
|
|
||||||
info = {
|
|
||||||
'thumbnail': video_data.get('thumb'),
|
|
||||||
'view_count': int_or_none(video_data.get('vcnt')),
|
|
||||||
'age_limit': int_or_none(video_data.get('grade')),
|
|
||||||
'uploader_id': channel_id,
|
|
||||||
'uploader': video_data.get('cname'),
|
|
||||||
}
|
|
||||||
|
|
||||||
if video_id:
|
|
||||||
entries = []
|
|
||||||
for i, f in enumerate(video_data.get('flist', [])):
|
|
||||||
video_key = self.parse_video_key(f.get('key', ''))
|
|
||||||
f_url = f.get('file')
|
|
||||||
if not video_key or not f_url:
|
|
||||||
continue
|
|
||||||
entries.append({
|
|
||||||
'id': '%s_%s' % (video_id, video_key.get('part', i + 1)),
|
|
||||||
'title': title,
|
|
||||||
'upload_date': video_key.get('upload_date'),
|
|
||||||
'duration': int_or_none(f.get('length')),
|
|
||||||
'url': f_url,
|
|
||||||
'protocol': 'm3u8_native',
|
|
||||||
'ext': 'mp4',
|
|
||||||
})
|
|
||||||
|
|
||||||
info.update({
|
|
||||||
'id': video_id,
|
|
||||||
'title': title,
|
|
||||||
'duration': int_or_none(video_data.get('length')),
|
|
||||||
})
|
|
||||||
if len(entries) > 1:
|
|
||||||
info['_type'] = 'multi_video'
|
|
||||||
info['entries'] = entries
|
|
||||||
elif len(entries) == 1:
|
|
||||||
i = entries[0].copy()
|
|
||||||
i.update(info)
|
|
||||||
info = i
|
|
||||||
else:
|
|
||||||
formats = []
|
|
||||||
for s in video_data.get('strm', []):
|
|
||||||
s_url = s.get('purl')
|
|
||||||
if not s_url:
|
|
||||||
continue
|
|
||||||
stype = s.get('stype')
|
|
||||||
if stype == 'HLS':
|
|
||||||
formats.extend(self._extract_m3u8_formats(
|
|
||||||
s_url, channel_id, 'mp4', m3u8_id=stype, fatal=False))
|
|
||||||
elif stype == 'RTMP':
|
|
||||||
format_id = [stype]
|
|
||||||
label = s.get('label')
|
|
||||||
if label:
|
|
||||||
format_id.append(label)
|
|
||||||
formats.append({
|
|
||||||
'format_id': '-'.join(format_id),
|
|
||||||
'url': s_url,
|
|
||||||
'tbr': int_or_none(s.get('bps')),
|
|
||||||
'height': int_or_none(s.get('brt')),
|
|
||||||
'ext': 'flv',
|
|
||||||
'rtmp_live': True,
|
|
||||||
})
|
|
||||||
self._sort_formats(formats)
|
|
||||||
|
|
||||||
info.update({
|
|
||||||
'id': channel_id,
|
|
||||||
'title': self._live_title(title),
|
|
||||||
'is_live': True,
|
|
||||||
'formats': formats,
|
|
||||||
})
|
|
||||||
|
|
||||||
return info
|
|
||||||
|
@ -18,6 +18,7 @@ from ..utils import (
|
|||||||
int_or_none,
|
int_or_none,
|
||||||
strip_jsonp,
|
strip_jsonp,
|
||||||
unescapeHTML,
|
unescapeHTML,
|
||||||
|
unsmuggle_url,
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
@ -197,12 +198,16 @@ class AnvatoIE(InfoExtractor):
|
|||||||
'tbr': tbr if tbr != 0 else None,
|
'tbr': tbr if tbr != 0 else None,
|
||||||
}
|
}
|
||||||
|
|
||||||
if ext == 'm3u8' or media_format in ('m3u8', 'm3u8-variant'):
|
if media_format == 'm3u8' and tbr is not None:
|
||||||
if tbr is not None:
|
|
||||||
a_format.update({
|
a_format.update({
|
||||||
'format_id': '-'.join(filter(None, ['hls', compat_str(tbr)])),
|
'format_id': '-'.join(filter(None, ['hls', compat_str(tbr)])),
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
})
|
})
|
||||||
|
elif media_format == 'm3u8-variant' or ext == 'm3u8':
|
||||||
|
formats.extend(self._extract_m3u8_formats(
|
||||||
|
video_url, video_id, 'mp4', entry_protocol='m3u8_native',
|
||||||
|
m3u8_id='hls', fatal=False))
|
||||||
|
continue
|
||||||
elif ext == 'mp3' or media_format == 'mp3':
|
elif ext == 'mp3' or media_format == 'mp3':
|
||||||
a_format['vcodec'] = 'none'
|
a_format['vcodec'] = 'none'
|
||||||
else:
|
else:
|
||||||
@ -271,6 +276,9 @@ class AnvatoIE(InfoExtractor):
|
|||||||
anvplayer_data['accessKey'], anvplayer_data['video'])
|
anvplayer_data['accessKey'], anvplayer_data['video'])
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
url, smuggled_data = unsmuggle_url(url, {})
|
||||||
|
self._initialize_geo_bypass(smuggled_data.get('geo_countries'))
|
||||||
|
|
||||||
mobj = re.match(self._VALID_URL, url)
|
mobj = re.match(self._VALID_URL, url)
|
||||||
access_key, video_id = mobj.group('access_key_or_mcp', 'id')
|
access_key, video_id = mobj.group('access_key_or_mcp', 'id')
|
||||||
if access_key not in self._ANVACK_TABLE:
|
if access_key not in self._ANVACK_TABLE:
|
||||||
|
@ -117,7 +117,7 @@ class AppleTrailersIE(InfoExtractor):
|
|||||||
continue
|
continue
|
||||||
formats.append({
|
formats.append({
|
||||||
'format_id': '%s-%s' % (version, size),
|
'format_id': '%s-%s' % (version, size),
|
||||||
'url': re.sub(r'_(\d+p.mov)', r'_h\1', src),
|
'url': re.sub(r'_(\d+p\.mov)', r'_h\1', src),
|
||||||
'width': int_or_none(size_data.get('width')),
|
'width': int_or_none(size_data.get('width')),
|
||||||
'height': int_or_none(size_data.get('height')),
|
'height': int_or_none(size_data.get('height')),
|
||||||
'language': version[:2],
|
'language': version[:2],
|
||||||
@ -179,7 +179,7 @@ class AppleTrailersIE(InfoExtractor):
|
|||||||
formats = []
|
formats = []
|
||||||
for format in settings['metadata']['sizes']:
|
for format in settings['metadata']['sizes']:
|
||||||
# The src is a file pointing to the real video file
|
# The src is a file pointing to the real video file
|
||||||
format_url = re.sub(r'_(\d*p.mov)', r'_h\1', format['src'])
|
format_url = re.sub(r'_(\d*p\.mov)', r'_h\1', format['src'])
|
||||||
formats.append({
|
formats.append({
|
||||||
'url': format_url,
|
'url': format_url,
|
||||||
'format': format['type'],
|
'format': format['type'],
|
||||||
|
@ -195,7 +195,7 @@ class ARDMediathekIE(InfoExtractor):
|
|||||||
|
|
||||||
title = self._html_search_regex(
|
title = self._html_search_regex(
|
||||||
[r'<h1(?:\s+class="boxTopHeadline")?>(.*?)</h1>',
|
[r'<h1(?:\s+class="boxTopHeadline")?>(.*?)</h1>',
|
||||||
r'<meta name="dcterms.title" content="(.*?)"/>',
|
r'<meta name="dcterms\.title" content="(.*?)"/>',
|
||||||
r'<h4 class="headline">(.*?)</h4>'],
|
r'<h4 class="headline">(.*?)</h4>'],
|
||||||
webpage, 'title')
|
webpage, 'title')
|
||||||
description = self._html_search_meta(
|
description = self._html_search_meta(
|
||||||
|
@ -6,6 +6,7 @@ import re
|
|||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
from ..compat import (
|
from ..compat import (
|
||||||
compat_parse_qs,
|
compat_parse_qs,
|
||||||
|
compat_str,
|
||||||
compat_urllib_parse_urlparse,
|
compat_urllib_parse_urlparse,
|
||||||
)
|
)
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
@ -15,6 +16,7 @@ from ..utils import (
|
|||||||
int_or_none,
|
int_or_none,
|
||||||
NO_DEFAULT,
|
NO_DEFAULT,
|
||||||
qualities,
|
qualities,
|
||||||
|
try_get,
|
||||||
unified_strdate,
|
unified_strdate,
|
||||||
)
|
)
|
||||||
|
|
||||||
@ -80,12 +82,15 @@ class ArteTVBaseIE(InfoExtractor):
|
|||||||
info = self._download_json(json_url, video_id)
|
info = self._download_json(json_url, video_id)
|
||||||
player_info = info['videoJsonPlayer']
|
player_info = info['videoJsonPlayer']
|
||||||
|
|
||||||
vsr = player_info['VSR']
|
vsr = try_get(player_info, lambda x: x['VSR'], dict)
|
||||||
|
|
||||||
if not vsr:
|
if not vsr:
|
||||||
raise ExtractorError(
|
error = None
|
||||||
'Video %s is not available' % player_info.get('VID') or video_id,
|
if try_get(player_info, lambda x: x['custom_msg']['type']) == 'error':
|
||||||
expected=True)
|
error = try_get(
|
||||||
|
player_info, lambda x: x['custom_msg']['msg'], compat_str)
|
||||||
|
if not error:
|
||||||
|
error = 'Video %s is not available' % player_info.get('VID') or video_id
|
||||||
|
raise ExtractorError(error, expected=True)
|
||||||
|
|
||||||
upload_date_str = player_info.get('shootingDate')
|
upload_date_str = player_info.get('shootingDate')
|
||||||
if not upload_date_str:
|
if not upload_date_str:
|
||||||
|
@ -47,7 +47,7 @@ class AZMedienIE(AZMedienBaseIE):
|
|||||||
'url': 'http://www.telezueri.ch/62-show-zuerinews/13772-episode-sonntag-18-dezember-2016/32419-segment-massenabweisungen-beim-hiltl-club-wegen-pelzboom',
|
'url': 'http://www.telezueri.ch/62-show-zuerinews/13772-episode-sonntag-18-dezember-2016/32419-segment-massenabweisungen-beim-hiltl-club-wegen-pelzboom',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '1_2444peh4',
|
'id': '1_2444peh4',
|
||||||
'ext': 'mov',
|
'ext': 'mp4',
|
||||||
'title': 'Massenabweisungen beim Hiltl Club wegen Pelzboom',
|
'title': 'Massenabweisungen beim Hiltl Club wegen Pelzboom',
|
||||||
'description': 'md5:9ea9dd1b159ad65b36ddcf7f0d7c76a8',
|
'description': 'md5:9ea9dd1b159ad65b36ddcf7f0d7c76a8',
|
||||||
'uploader_id': 'TeleZ?ri',
|
'uploader_id': 'TeleZ?ri',
|
||||||
|
@ -386,7 +386,7 @@ class BBCCoUkIE(InfoExtractor):
|
|||||||
m3u8_id=format_id, fatal=False))
|
m3u8_id=format_id, fatal=False))
|
||||||
if re.search(self._USP_RE, href):
|
if re.search(self._USP_RE, href):
|
||||||
usp_formats = self._extract_m3u8_formats(
|
usp_formats = self._extract_m3u8_formats(
|
||||||
re.sub(self._USP_RE, r'/\1.ism/\1.m3u8', href),
|
re.sub(self._USP_RE, r'/\1\.ism/\1\.m3u8', href),
|
||||||
programme_id, ext='mp4', entry_protocol='m3u8_native',
|
programme_id, ext='mp4', entry_protocol='m3u8_native',
|
||||||
m3u8_id=format_id, fatal=False)
|
m3u8_id=format_id, fatal=False)
|
||||||
for f in usp_formats:
|
for f in usp_formats:
|
||||||
|
@ -1,16 +1,22 @@
|
|||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
import re
|
import re
|
||||||
|
import json
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
|
from .gigya import GigyaBaseIE
|
||||||
|
from ..compat import compat_HTTPError
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
float_or_none,
|
ExtractorError,
|
||||||
strip_or_none,
|
strip_or_none,
|
||||||
|
float_or_none,
|
||||||
|
int_or_none,
|
||||||
|
parse_iso8601,
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
class CanvasIE(InfoExtractor):
|
class CanvasIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://mediazone\.vrt\.be/api/v1/(?P<site_id>canvas|een|ketnet)/assets/(?P<id>m[dz]-ast-[^/?#&]+)'
|
_VALID_URL = r'https?://mediazone\.vrt\.be/api/v1/(?P<site_id>canvas|een|ketnet|vrtvideo)/assets/(?P<id>[^/?#&]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://mediazone.vrt.be/api/v1/ketnet/assets/md-ast-4ac54990-ce66-4d00-a8ca-9eac86f4c475',
|
'url': 'https://mediazone.vrt.be/api/v1/ketnet/assets/md-ast-4ac54990-ce66-4d00-a8ca-9eac86f4c475',
|
||||||
'md5': '90139b746a0a9bd7bb631283f6e2a64e',
|
'md5': '90139b746a0a9bd7bb631283f6e2a64e',
|
||||||
@ -166,3 +172,139 @@ class CanvasEenIE(InfoExtractor):
|
|||||||
'title': title,
|
'title': title,
|
||||||
'description': self._og_search_description(webpage),
|
'description': self._og_search_description(webpage),
|
||||||
}
|
}
|
||||||
|
|
||||||
|
|
||||||
|
class VrtNUIE(GigyaBaseIE):
|
||||||
|
IE_DESC = 'VrtNU.be'
|
||||||
|
_VALID_URL = r'https?://(?:www\.)?vrt\.be/(?P<site_id>vrtnu)/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'https://www.vrt.be/vrtnu/a-z/postbus-x/1/postbus-x-s1a1/',
|
||||||
|
'info_dict': {
|
||||||
|
'id': 'pbs-pub-2e2d8c27-df26-45c9-9dc6-90c78153044d$vid-90c932b1-e21d-4fb8-99b1-db7b49cf74de',
|
||||||
|
'ext': 'flv',
|
||||||
|
'title': 'De zwarte weduwe',
|
||||||
|
'description': 'md5:d90c21dced7db869a85db89a623998d4',
|
||||||
|
'duration': 1457.04,
|
||||||
|
'thumbnail': r're:^https?://.*\.jpg$',
|
||||||
|
'season': '1',
|
||||||
|
'season_number': 1,
|
||||||
|
'episode_number': 1,
|
||||||
|
},
|
||||||
|
'skip': 'This video is only available for registered users'
|
||||||
|
}]
|
||||||
|
_NETRC_MACHINE = 'vrtnu'
|
||||||
|
_APIKEY = '3_0Z2HujMtiWq_pkAjgnS2Md2E11a1AwZjYiBETtwNE-EoEHDINgtnvcAOpNgmrVGy'
|
||||||
|
_CONTEXT_ID = 'R3595707040'
|
||||||
|
|
||||||
|
def _real_initialize(self):
|
||||||
|
self._login()
|
||||||
|
|
||||||
|
def _login(self):
|
||||||
|
username, password = self._get_login_info()
|
||||||
|
if username is None:
|
||||||
|
return
|
||||||
|
|
||||||
|
auth_data = {
|
||||||
|
'APIKey': self._APIKEY,
|
||||||
|
'targetEnv': 'jssdk',
|
||||||
|
'loginID': username,
|
||||||
|
'password': password,
|
||||||
|
'authMode': 'cookie',
|
||||||
|
}
|
||||||
|
|
||||||
|
auth_info = self._gigya_login(auth_data)
|
||||||
|
|
||||||
|
# Sometimes authentication fails for no good reason, retry
|
||||||
|
login_attempt = 1
|
||||||
|
while login_attempt <= 3:
|
||||||
|
try:
|
||||||
|
# When requesting a token, no actual token is returned, but the
|
||||||
|
# necessary cookies are set.
|
||||||
|
self._request_webpage(
|
||||||
|
'https://token.vrt.be',
|
||||||
|
None, note='Requesting a token', errnote='Could not get a token',
|
||||||
|
headers={
|
||||||
|
'Content-Type': 'application/json',
|
||||||
|
'Referer': 'https://www.vrt.be/vrtnu/',
|
||||||
|
},
|
||||||
|
data=json.dumps({
|
||||||
|
'uid': auth_info['UID'],
|
||||||
|
'uidsig': auth_info['UIDSignature'],
|
||||||
|
'ts': auth_info['signatureTimestamp'],
|
||||||
|
'email': auth_info['profile']['email'],
|
||||||
|
}).encode('utf-8'))
|
||||||
|
except ExtractorError as e:
|
||||||
|
if isinstance(e.cause, compat_HTTPError) and e.cause.code == 401:
|
||||||
|
login_attempt += 1
|
||||||
|
self.report_warning('Authentication failed')
|
||||||
|
self._sleep(1, None, msg_template='Waiting for %(timeout)s seconds before trying again')
|
||||||
|
else:
|
||||||
|
raise e
|
||||||
|
else:
|
||||||
|
break
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
display_id = self._match_id(url)
|
||||||
|
|
||||||
|
webpage = self._download_webpage(url, display_id)
|
||||||
|
|
||||||
|
title = self._html_search_regex(
|
||||||
|
r'(?ms)<h1 class="content__heading">(.+?)</h1>',
|
||||||
|
webpage, 'title').strip()
|
||||||
|
|
||||||
|
description = self._html_search_regex(
|
||||||
|
r'(?ms)<div class="content__description">(.+?)</div>',
|
||||||
|
webpage, 'description', default=None)
|
||||||
|
|
||||||
|
season = self._html_search_regex(
|
||||||
|
[r'''(?xms)<div\ class="tabs__tab\ tabs__tab--active">\s*
|
||||||
|
<span>seizoen\ (.+?)</span>\s*
|
||||||
|
</div>''',
|
||||||
|
r'<option value="seizoen (\d{1,3})" data-href="[^"]+?" selected>'],
|
||||||
|
webpage, 'season', default=None)
|
||||||
|
|
||||||
|
season_number = int_or_none(season)
|
||||||
|
|
||||||
|
episode_number = int_or_none(self._html_search_regex(
|
||||||
|
r'''(?xms)<div\ class="content__episode">\s*
|
||||||
|
<abbr\ title="aflevering">afl</abbr>\s*<span>(\d+)</span>
|
||||||
|
</div>''',
|
||||||
|
webpage, 'episode_number', default=None))
|
||||||
|
|
||||||
|
release_date = parse_iso8601(self._html_search_regex(
|
||||||
|
r'(?ms)<div class="content__broadcastdate">\s*<time\ datetime="(.+?)"',
|
||||||
|
webpage, 'release_date', default=None))
|
||||||
|
|
||||||
|
# If there's a ? or a # in the URL, remove them and everything after
|
||||||
|
clean_url = url.split('?')[0].split('#')[0].strip('/')
|
||||||
|
securevideo_url = clean_url + '.mssecurevideo.json'
|
||||||
|
|
||||||
|
try:
|
||||||
|
video = self._download_json(securevideo_url, display_id)
|
||||||
|
except ExtractorError as e:
|
||||||
|
if isinstance(e.cause, compat_HTTPError) and e.cause.code == 401:
|
||||||
|
self.raise_login_required()
|
||||||
|
raise
|
||||||
|
|
||||||
|
# We are dealing with a '../<show>.relevant' URL
|
||||||
|
redirect_url = video.get('url')
|
||||||
|
if redirect_url:
|
||||||
|
return self.url_result(self._proto_relative_url(redirect_url, 'https:'))
|
||||||
|
|
||||||
|
# There is only one entry, but with an unknown key, so just get
|
||||||
|
# the first one
|
||||||
|
video_id = list(video.values())[0].get('videoid')
|
||||||
|
|
||||||
|
return {
|
||||||
|
'_type': 'url_transparent',
|
||||||
|
'url': 'https://mediazone.vrt.be/api/v1/vrtvideo/assets/%s' % video_id,
|
||||||
|
'ie_key': CanvasIE.ie_key(),
|
||||||
|
'id': video_id,
|
||||||
|
'display_id': display_id,
|
||||||
|
'title': title,
|
||||||
|
'description': description,
|
||||||
|
'season': season,
|
||||||
|
'season_number': season_number,
|
||||||
|
'episode_number': episode_number,
|
||||||
|
'release_date': release_date,
|
||||||
|
}
|
||||||
|
@ -81,6 +81,12 @@ class Channel9IE(InfoExtractor):
|
|||||||
|
|
||||||
_RSS_URL = 'http://channel9.msdn.com/%s/RSS'
|
_RSS_URL = 'http://channel9.msdn.com/%s/RSS'
|
||||||
|
|
||||||
|
@staticmethod
|
||||||
|
def _extract_urls(webpage):
|
||||||
|
return re.findall(
|
||||||
|
r'<iframe[^>]+src=["\'](https?://channel9\.msdn\.com/(?:[^/]+/)+)player\b',
|
||||||
|
webpage)
|
||||||
|
|
||||||
def _extract_list(self, video_id, rss_url=None):
|
def _extract_list(self, video_id, rss_url=None):
|
||||||
if not rss_url:
|
if not rss_url:
|
||||||
rss_url = self._RSS_URL % video_id
|
rss_url = self._RSS_URL % video_id
|
||||||
|
@ -2338,7 +2338,6 @@ class InfoExtractor(object):
|
|||||||
formats = self._parse_jwplayer_formats(
|
formats = self._parse_jwplayer_formats(
|
||||||
video_data['sources'], video_id=this_video_id, m3u8_id=m3u8_id,
|
video_data['sources'], video_id=this_video_id, m3u8_id=m3u8_id,
|
||||||
mpd_id=mpd_id, rtmp_params=rtmp_params, base_url=base_url)
|
mpd_id=mpd_id, rtmp_params=rtmp_params, base_url=base_url)
|
||||||
self._sort_formats(formats)
|
|
||||||
|
|
||||||
subtitles = {}
|
subtitles = {}
|
||||||
tracks = video_data.get('tracks')
|
tracks = video_data.get('tracks')
|
||||||
@ -2355,16 +2354,25 @@ class InfoExtractor(object):
|
|||||||
'url': self._proto_relative_url(track_url)
|
'url': self._proto_relative_url(track_url)
|
||||||
})
|
})
|
||||||
|
|
||||||
entries.append({
|
entry = {
|
||||||
'id': this_video_id,
|
'id': this_video_id,
|
||||||
'title': video_data['title'] if require_title else video_data.get('title'),
|
'title': unescapeHTML(video_data['title'] if require_title else video_data.get('title')),
|
||||||
'description': video_data.get('description'),
|
'description': video_data.get('description'),
|
||||||
'thumbnail': self._proto_relative_url(video_data.get('image')),
|
'thumbnail': self._proto_relative_url(video_data.get('image')),
|
||||||
'timestamp': int_or_none(video_data.get('pubdate')),
|
'timestamp': int_or_none(video_data.get('pubdate')),
|
||||||
'duration': float_or_none(jwplayer_data.get('duration') or video_data.get('duration')),
|
'duration': float_or_none(jwplayer_data.get('duration') or video_data.get('duration')),
|
||||||
'subtitles': subtitles,
|
'subtitles': subtitles,
|
||||||
'formats': formats,
|
}
|
||||||
|
# https://github.com/jwplayer/jwplayer/blob/master/src/js/utils/validator.js#L32
|
||||||
|
if len(formats) == 1 and re.search(r'^(?:http|//).*(?:youtube\.com|youtu\.be)/.+', formats[0]['url']):
|
||||||
|
entry.update({
|
||||||
|
'_type': 'url_transparent',
|
||||||
|
'url': formats[0]['url'],
|
||||||
})
|
})
|
||||||
|
else:
|
||||||
|
self._sort_formats(formats)
|
||||||
|
entry['formats'] = formats
|
||||||
|
entries.append(entry)
|
||||||
if len(entries) == 1:
|
if len(entries) == 1:
|
||||||
return entries[0]
|
return entries[0]
|
||||||
else:
|
else:
|
||||||
|
@ -235,7 +235,7 @@ class DailymotionIE(DailymotionBaseInfoExtractor):
|
|||||||
|
|
||||||
# vevo embed
|
# vevo embed
|
||||||
vevo_id = self._search_regex(
|
vevo_id = self._search_regex(
|
||||||
r'<link rel="video_src" href="[^"]*?vevo.com[^"]*?video=(?P<id>[\w]*)',
|
r'<link rel="video_src" href="[^"]*?vevo\.com[^"]*?video=(?P<id>[\w]*)',
|
||||||
webpage, 'vevo embed', default=None)
|
webpage, 'vevo embed', default=None)
|
||||||
if vevo_id:
|
if vevo_id:
|
||||||
return self.url_result('vevo:%s' % vevo_id, 'Vevo')
|
return self.url_result('vevo:%s' % vevo_id, 'Vevo')
|
||||||
|
@ -19,7 +19,7 @@ class DeezerPlaylistIE(InfoExtractor):
|
|||||||
'id': '176747451',
|
'id': '176747451',
|
||||||
'title': 'Best!',
|
'title': 'Best!',
|
||||||
'uploader': 'Anonymous',
|
'uploader': 'Anonymous',
|
||||||
'thumbnail': r're:^https?://cdn-images.deezer.com/images/cover/.*\.jpg$',
|
'thumbnail': r're:^https?://cdn-images\.deezer\.com/images/cover/.*\.jpg$',
|
||||||
},
|
},
|
||||||
'playlist_count': 30,
|
'playlist_count': 30,
|
||||||
'skip': 'Only available in .de',
|
'skip': 'Only available in .de',
|
||||||
|
@ -59,7 +59,7 @@ class DramaFeverBaseIE(AMPIE):
|
|||||||
if all(logout_pattern not in response
|
if all(logout_pattern not in response
|
||||||
for logout_pattern in ['href="/accounts/logout/"', '>Log out<']):
|
for logout_pattern in ['href="/accounts/logout/"', '>Log out<']):
|
||||||
error = self._html_search_regex(
|
error = self._html_search_regex(
|
||||||
r'(?s)class="hidden-xs prompt"[^>]*>(.+?)<',
|
r'(?s)<h\d[^>]+\bclass="hidden-xs prompt"[^>]*>(.+?)</h\d',
|
||||||
response, 'error message', default=None)
|
response, 'error message', default=None)
|
||||||
if error:
|
if error:
|
||||||
raise ExtractorError('Unable to login: %s' % error, expected=True)
|
raise ExtractorError('Unable to login: %s' % error, expected=True)
|
||||||
|
@ -138,6 +138,7 @@ class DRTVIE(InfoExtractor):
|
|||||||
'tbr': int_or_none(bitrate),
|
'tbr': int_or_none(bitrate),
|
||||||
'ext': link.get('FileFormat'),
|
'ext': link.get('FileFormat'),
|
||||||
'vcodec': 'none' if kind == 'AudioResource' else None,
|
'vcodec': 'none' if kind == 'AudioResource' else None,
|
||||||
|
'preference': preference,
|
||||||
})
|
})
|
||||||
subtitles_list = asset.get('SubtitlesList')
|
subtitles_list = asset.get('SubtitlesList')
|
||||||
if isinstance(subtitles_list, list):
|
if isinstance(subtitles_list, list):
|
||||||
|
@ -15,7 +15,7 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class EpornerIE(InfoExtractor):
|
class EpornerIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?eporner\.com/hd-porn/(?P<id>\w+)(?:/(?P<display_id>[\w-]+))?'
|
_VALID_URL = r'https?://(?:www\.)?eporner\.com/(?:hd-porn|embed)/(?P<id>\w+)(?:/(?P<display_id>[\w-]+))?'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.eporner.com/hd-porn/95008/Infamous-Tiffany-Teen-Strip-Tease-Video/',
|
'url': 'http://www.eporner.com/hd-porn/95008/Infamous-Tiffany-Teen-Strip-Tease-Video/',
|
||||||
'md5': '39d486f046212d8e1b911c52ab4691f8',
|
'md5': '39d486f046212d8e1b911c52ab4691f8',
|
||||||
@ -35,6 +35,9 @@ class EpornerIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'http://www.eporner.com/hd-porn/3YRUtzMcWn0',
|
'url': 'http://www.eporner.com/hd-porn/3YRUtzMcWn0',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.eporner.com/hd-porn/3YRUtzMcWn0',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -31,10 +31,7 @@ from .aenetworks import (
|
|||||||
AENetworksIE,
|
AENetworksIE,
|
||||||
HistoryTopicIE,
|
HistoryTopicIE,
|
||||||
)
|
)
|
||||||
from .afreecatv import (
|
from .afreecatv import AfreecaTVIE
|
||||||
AfreecaTVIE,
|
|
||||||
AfreecaTVGlobalIE,
|
|
||||||
)
|
|
||||||
from .airmozilla import AirMozillaIE
|
from .airmozilla import AirMozillaIE
|
||||||
from .aljazeera import AlJazeeraIE
|
from .aljazeera import AlJazeeraIE
|
||||||
from .alphaporno import AlphaPornoIE
|
from .alphaporno import AlphaPornoIE
|
||||||
@ -153,6 +150,7 @@ from .canalc2 import Canalc2IE
|
|||||||
from .canvas import (
|
from .canvas import (
|
||||||
CanvasIE,
|
CanvasIE,
|
||||||
CanvasEenIE,
|
CanvasEenIE,
|
||||||
|
VrtNUIE,
|
||||||
)
|
)
|
||||||
from .carambatv import (
|
from .carambatv import (
|
||||||
CarambaTVIE,
|
CarambaTVIE,
|
||||||
@ -384,6 +382,7 @@ from .freesound import FreesoundIE
|
|||||||
from .freespeech import FreespeechIE
|
from .freespeech import FreespeechIE
|
||||||
from .freshlive import FreshLiveIE
|
from .freshlive import FreshLiveIE
|
||||||
from .funimation import FunimationIE
|
from .funimation import FunimationIE
|
||||||
|
from .funk import FunkIE
|
||||||
from .funnyordie import FunnyOrDieIE
|
from .funnyordie import FunnyOrDieIE
|
||||||
from .fusion import FusionIE
|
from .fusion import FusionIE
|
||||||
from .fxnetworks import FXNetworksIE
|
from .fxnetworks import FXNetworksIE
|
||||||
@ -625,7 +624,6 @@ from .mwave import MwaveIE, MwaveMeetGreetIE
|
|||||||
from .myspace import MySpaceIE, MySpaceAlbumIE
|
from .myspace import MySpaceIE, MySpaceAlbumIE
|
||||||
from .myspass import MySpassIE
|
from .myspass import MySpassIE
|
||||||
from .myvi import MyviIE
|
from .myvi import MyviIE
|
||||||
from .myvideo import MyVideoIE
|
|
||||||
from .myvidster import MyVidsterIE
|
from .myvidster import MyVidsterIE
|
||||||
from .nationalgeographic import (
|
from .nationalgeographic import (
|
||||||
NationalGeographicVideoIE,
|
NationalGeographicVideoIE,
|
||||||
@ -927,6 +925,7 @@ from .seeker import SeekerIE
|
|||||||
from .senateisvp import SenateISVPIE
|
from .senateisvp import SenateISVPIE
|
||||||
from .sendtonews import SendtoNewsIE
|
from .sendtonews import SendtoNewsIE
|
||||||
from .servingsys import ServingSysIE
|
from .servingsys import ServingSysIE
|
||||||
|
from .servus import ServusIE
|
||||||
from .sexu import SexuIE
|
from .sexu import SexuIE
|
||||||
from .shahid import ShahidIE
|
from .shahid import ShahidIE
|
||||||
from .shared import (
|
from .shared import (
|
||||||
@ -943,6 +942,7 @@ from .skynewsarabia import (
|
|||||||
)
|
)
|
||||||
from .skysports import SkySportsIE
|
from .skysports import SkySportsIE
|
||||||
from .slideshare import SlideshareIE
|
from .slideshare import SlideshareIE
|
||||||
|
from .slideslive import SlidesLiveIE
|
||||||
from .slutload import SlutloadIE
|
from .slutload import SlutloadIE
|
||||||
from .smotri import (
|
from .smotri import (
|
||||||
SmotriIE,
|
SmotriIE,
|
||||||
@ -1139,6 +1139,7 @@ from .udn import UDNEmbedIE
|
|||||||
from .uktvplay import UKTVPlayIE
|
from .uktvplay import UKTVPlayIE
|
||||||
from .digiteka import DigitekaIE
|
from .digiteka import DigitekaIE
|
||||||
from .unistra import UnistraIE
|
from .unistra import UnistraIE
|
||||||
|
from .unity import UnityIE
|
||||||
from .uol import UOLIE
|
from .uol import UOLIE
|
||||||
from .uplynk import (
|
from .uplynk import (
|
||||||
UplynkIE,
|
UplynkIE,
|
||||||
@ -1246,7 +1247,10 @@ from .vodpl import VODPlIE
|
|||||||
from .vodplatform import VODPlatformIE
|
from .vodplatform import VODPlatformIE
|
||||||
from .voicerepublic import VoiceRepublicIE
|
from .voicerepublic import VoiceRepublicIE
|
||||||
from .voot import VootIE
|
from .voot import VootIE
|
||||||
from .voxmedia import VoxMediaIE
|
from .voxmedia import (
|
||||||
|
VoxMediaVolumeIE,
|
||||||
|
VoxMediaIE,
|
||||||
|
)
|
||||||
from .vporn import VpornIE
|
from .vporn import VpornIE
|
||||||
from .vrt import VRTIE
|
from .vrt import VRTIE
|
||||||
from .vrak import VrakIE
|
from .vrak import VrakIE
|
||||||
@ -1345,7 +1349,6 @@ from .youtube import (
|
|||||||
YoutubeSearchDateIE,
|
YoutubeSearchDateIE,
|
||||||
YoutubeSearchIE,
|
YoutubeSearchIE,
|
||||||
YoutubeSearchURLIE,
|
YoutubeSearchURLIE,
|
||||||
YoutubeSharedVideoIE,
|
|
||||||
YoutubeShowIE,
|
YoutubeShowIE,
|
||||||
YoutubeSubscriptionsIE,
|
YoutubeSubscriptionsIE,
|
||||||
YoutubeTruncatedIDIE,
|
YoutubeTruncatedIDIE,
|
||||||
|
@ -67,9 +67,9 @@ class FacebookIE(InfoExtractor):
|
|||||||
'uploader': 'Tennis on Facebook',
|
'uploader': 'Tennis on Facebook',
|
||||||
'upload_date': '20140908',
|
'upload_date': '20140908',
|
||||||
'timestamp': 1410199200,
|
'timestamp': 1410199200,
|
||||||
}
|
},
|
||||||
|
'skip': 'Requires logging in',
|
||||||
}, {
|
}, {
|
||||||
'note': 'Video without discernible title',
|
|
||||||
'url': 'https://www.facebook.com/video.php?v=274175099429670',
|
'url': 'https://www.facebook.com/video.php?v=274175099429670',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '274175099429670',
|
'id': '274175099429670',
|
||||||
@ -78,6 +78,7 @@ class FacebookIE(InfoExtractor):
|
|||||||
'uploader': 'Asif Nawab Butt',
|
'uploader': 'Asif Nawab Butt',
|
||||||
'upload_date': '20140506',
|
'upload_date': '20140506',
|
||||||
'timestamp': 1399398998,
|
'timestamp': 1399398998,
|
||||||
|
'thumbnail': r're:^https?://.*',
|
||||||
},
|
},
|
||||||
'expected_warnings': [
|
'expected_warnings': [
|
||||||
'title'
|
'title'
|
||||||
@ -94,6 +95,7 @@ class FacebookIE(InfoExtractor):
|
|||||||
'upload_date': '20160110',
|
'upload_date': '20160110',
|
||||||
'timestamp': 1452431627,
|
'timestamp': 1452431627,
|
||||||
},
|
},
|
||||||
|
'skip': 'Requires logging in',
|
||||||
}, {
|
}, {
|
||||||
'url': 'https://www.facebook.com/maxlayn/posts/10153807558977570',
|
'url': 'https://www.facebook.com/maxlayn/posts/10153807558977570',
|
||||||
'md5': '037b1fa7f3c2d02b7a0d7bc16031ecc6',
|
'md5': '037b1fa7f3c2d02b7a0d7bc16031ecc6',
|
||||||
@ -121,7 +123,11 @@ class FacebookIE(InfoExtractor):
|
|||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '10153664894881749',
|
'id': '10153664894881749',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Facebook video #10153664894881749',
|
'title': 'Average time to confirm recent Supreme Court nominees: 67 days Longest it\'s t...',
|
||||||
|
'thumbnail': r're:^https?://.*',
|
||||||
|
'timestamp': 1456259628,
|
||||||
|
'upload_date': '20160223',
|
||||||
|
'uploader': 'Barack Obama',
|
||||||
},
|
},
|
||||||
}, {
|
}, {
|
||||||
# have 1080P, but only up to 720p in swf params
|
# have 1080P, but only up to 720p in swf params
|
||||||
@ -130,10 +136,11 @@ class FacebookIE(InfoExtractor):
|
|||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '10155529876156509',
|
'id': '10155529876156509',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Holocaust survivor becomes US citizen',
|
'title': 'She survived the holocaust — and years later, she’s getting her citizenship s...',
|
||||||
'timestamp': 1477818095,
|
'timestamp': 1477818095,
|
||||||
'upload_date': '20161030',
|
'upload_date': '20161030',
|
||||||
'uploader': 'CNN',
|
'uploader': 'CNN',
|
||||||
|
'thumbnail': r're:^https?://.*',
|
||||||
},
|
},
|
||||||
}, {
|
}, {
|
||||||
# bigPipe.onPageletArrive ... onPageletArrive pagelet_group_mall
|
# bigPipe.onPageletArrive ... onPageletArrive pagelet_group_mall
|
||||||
@ -158,6 +165,7 @@ class FacebookIE(InfoExtractor):
|
|||||||
'timestamp': 1477305000,
|
'timestamp': 1477305000,
|
||||||
'upload_date': '20161024',
|
'upload_date': '20161024',
|
||||||
'uploader': 'La Guía Del Varón',
|
'uploader': 'La Guía Del Varón',
|
||||||
|
'thumbnail': r're:^https?://.*',
|
||||||
},
|
},
|
||||||
'params': {
|
'params': {
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
@ -376,6 +384,7 @@ class FacebookIE(InfoExtractor):
|
|||||||
timestamp = int_or_none(self._search_regex(
|
timestamp = int_or_none(self._search_regex(
|
||||||
r'<abbr[^>]+data-utime=["\'](\d+)', webpage,
|
r'<abbr[^>]+data-utime=["\'](\d+)', webpage,
|
||||||
'timestamp', default=None))
|
'timestamp', default=None))
|
||||||
|
thumbnail = self._og_search_thumbnail(webpage)
|
||||||
|
|
||||||
info_dict = {
|
info_dict = {
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
@ -383,6 +392,7 @@ class FacebookIE(InfoExtractor):
|
|||||||
'formats': formats,
|
'formats': formats,
|
||||||
'uploader': uploader,
|
'uploader': uploader,
|
||||||
'timestamp': timestamp,
|
'timestamp': timestamp,
|
||||||
|
'thumbnail': thumbnail,
|
||||||
}
|
}
|
||||||
|
|
||||||
return webpage, info_dict
|
return webpage, info_dict
|
||||||
|
@ -2,7 +2,10 @@
|
|||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
from .adobepass import AdobePassIE
|
from .adobepass import AdobePassIE
|
||||||
|
from .uplynk import UplynkPreplayIE
|
||||||
|
from ..compat import compat_str
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
|
HEADRequest,
|
||||||
int_or_none,
|
int_or_none,
|
||||||
parse_age_limit,
|
parse_age_limit,
|
||||||
parse_duration,
|
parse_duration,
|
||||||
@ -53,14 +56,7 @@ class FOXIE(AdobePassIE):
|
|||||||
})
|
})
|
||||||
|
|
||||||
title = video['name']
|
title = video['name']
|
||||||
|
release_url = video['videoRelease']['url']
|
||||||
m3u8_url = self._download_json(
|
|
||||||
video['videoRelease']['url'], video_id)['playURL']
|
|
||||||
|
|
||||||
formats = self._extract_m3u8_formats(
|
|
||||||
m3u8_url, video_id, 'mp4',
|
|
||||||
entry_protocol='m3u8_native', m3u8_id='hls')
|
|
||||||
self._sort_formats(formats)
|
|
||||||
|
|
||||||
description = video.get('description')
|
description = video.get('description')
|
||||||
duration = int_or_none(video.get('durationInSeconds')) or int_or_none(
|
duration = int_or_none(video.get('durationInSeconds')) or int_or_none(
|
||||||
@ -84,7 +80,7 @@ class FOXIE(AdobePassIE):
|
|||||||
# TODO: AP
|
# TODO: AP
|
||||||
pass
|
pass
|
||||||
|
|
||||||
return {
|
info = {
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
'title': title,
|
'title': title,
|
||||||
'description': description,
|
'description': description,
|
||||||
@ -97,5 +93,22 @@ class FOXIE(AdobePassIE):
|
|||||||
'episode': episode,
|
'episode': episode,
|
||||||
'episode_number': episode_number,
|
'episode_number': episode_number,
|
||||||
'release_year': release_year,
|
'release_year': release_year,
|
||||||
'formats': formats,
|
|
||||||
}
|
}
|
||||||
|
|
||||||
|
urlh = self._request_webpage(HEADRequest(release_url), video_id)
|
||||||
|
video_url = compat_str(urlh.geturl())
|
||||||
|
|
||||||
|
if UplynkPreplayIE.suitable(video_url):
|
||||||
|
info.update({
|
||||||
|
'_type': 'url_transparent',
|
||||||
|
'url': video_url,
|
||||||
|
'ie_key': UplynkPreplayIE.ie_key(),
|
||||||
|
})
|
||||||
|
else:
|
||||||
|
m3u8_url = self._download_json(release_url, video_id)['playURL']
|
||||||
|
formats = self._extract_m3u8_formats(
|
||||||
|
m3u8_url, video_id, 'mp4',
|
||||||
|
entry_protocol='m3u8_native', m3u8_id='hls')
|
||||||
|
self._sort_formats(formats)
|
||||||
|
info['formats'] = formats
|
||||||
|
return info
|
||||||
|
@ -27,7 +27,7 @@ class FreespeechIE(InfoExtractor):
|
|||||||
mobj = re.match(self._VALID_URL, url)
|
mobj = re.match(self._VALID_URL, url)
|
||||||
title = mobj.group('title')
|
title = mobj.group('title')
|
||||||
webpage = self._download_webpage(url, title)
|
webpage = self._download_webpage(url, title)
|
||||||
info_json = self._search_regex(r'jQuery.extend\(Drupal.settings, ({.*?})\);', webpage, 'info')
|
info_json = self._search_regex(r'jQuery\.extend\(Drupal\.settings, ({.*?})\);', webpage, 'info')
|
||||||
info = json.loads(info_json)
|
info = json.loads(info_json)
|
||||||
|
|
||||||
return {
|
return {
|
||||||
|
43
youtube_dl/extractor/funk.py
Normal file
43
youtube_dl/extractor/funk.py
Normal file
@ -0,0 +1,43 @@
|
|||||||
|
# coding: utf-8
|
||||||
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
from .nexx import NexxIE
|
||||||
|
from ..utils import extract_attributes
|
||||||
|
|
||||||
|
|
||||||
|
class FunkIE(InfoExtractor):
|
||||||
|
_VALID_URL = r'https?://(?:www\.)?funk\.net/(?:mix|channel)/(?:[^/]+/)*(?P<id>[^?/#]+)'
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'https://www.funk.net/mix/59d65d935f8b160001828b5b/0/59d517e741dca10001252574/',
|
||||||
|
'md5': '4d40974481fa3475f8bccfd20c5361f8',
|
||||||
|
'info_dict': {
|
||||||
|
'id': '716599',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': 'Neue Rechte Welle',
|
||||||
|
'description': 'md5:a30a53f740ffb6bfd535314c2cc5fb69',
|
||||||
|
'timestamp': 1501337639,
|
||||||
|
'upload_date': '20170729',
|
||||||
|
},
|
||||||
|
'params': {
|
||||||
|
'format': 'bestvideo',
|
||||||
|
'skip_download': True,
|
||||||
|
},
|
||||||
|
}, {
|
||||||
|
'url': 'https://www.funk.net/channel/59d5149841dca100012511e3/0/59d52049999264000182e79d/',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
video_id = self._match_id(url)
|
||||||
|
|
||||||
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
|
domain_id = NexxIE._extract_domain_id(webpage) or '741'
|
||||||
|
nexx_id = extract_attributes(self._search_regex(
|
||||||
|
r'(<div[^>]id=["\']mediaplayer-funk[^>]+>)',
|
||||||
|
webpage, 'media player'))['data-id']
|
||||||
|
|
||||||
|
return self.url_result(
|
||||||
|
'nexx:%s:%s' % (domain_id, nexx_id), ie=NexxIE.ie_key(),
|
||||||
|
video_id=nexx_id)
|
@ -105,7 +105,7 @@ class GameSpotIE(OnceIE):
|
|||||||
onceux_url = self._parse_json(unescapeHTML(onceux_json), page_id).get('metadataUri')
|
onceux_url = self._parse_json(unescapeHTML(onceux_json), page_id).get('metadataUri')
|
||||||
if onceux_url:
|
if onceux_url:
|
||||||
formats.extend(self._extract_once_formats(re.sub(
|
formats.extend(self._extract_once_formats(re.sub(
|
||||||
r'https?://[^/]+', 'http://once.unicornmedia.com', onceux_url).replace('ads/vmap/', '')))
|
r'https?://[^/]+', 'http://once.unicornmedia.com', onceux_url)))
|
||||||
|
|
||||||
if not formats:
|
if not formats:
|
||||||
for quality in ['sd', 'hd']:
|
for quality in ['sd', 'hd']:
|
||||||
|
@ -101,6 +101,7 @@ from .mediaset import MediasetIE
|
|||||||
from .joj import JojIE
|
from .joj import JojIE
|
||||||
from .megaphone import MegaphoneIE
|
from .megaphone import MegaphoneIE
|
||||||
from .vzaar import VzaarIE
|
from .vzaar import VzaarIE
|
||||||
|
from .channel9 import Channel9IE
|
||||||
|
|
||||||
|
|
||||||
class GenericIE(InfoExtractor):
|
class GenericIE(InfoExtractor):
|
||||||
@ -1090,7 +1091,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'upload_date': '20150212',
|
'upload_date': '20150212',
|
||||||
'uploader': 'The National Archives UK',
|
'uploader': 'The National Archives UK',
|
||||||
'description': 'md5:a236581cd2449dd2df4f93412f3f01c6',
|
'description': 'md5:8078af856dca76edc42910b61273dbbf',
|
||||||
'uploader_id': 'NationalArchives08',
|
'uploader_id': 'NationalArchives08',
|
||||||
'title': 'Webinar: Using Discovery, The National Archives’ online catalogue',
|
'title': 'Webinar: Using Discovery, The National Archives’ online catalogue',
|
||||||
},
|
},
|
||||||
@ -1106,7 +1107,8 @@ class GenericIE(InfoExtractor):
|
|||||||
},
|
},
|
||||||
'params': {
|
'params': {
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
}
|
},
|
||||||
|
'skip': 'does not contain a video anymore',
|
||||||
},
|
},
|
||||||
# Complex jwplayer
|
# Complex jwplayer
|
||||||
{
|
{
|
||||||
@ -1115,6 +1117,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'id': 'videos',
|
'id': 'videos',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'king machine trailer 1',
|
'title': 'king machine trailer 1',
|
||||||
|
'description': 'Browse King Machine videos & audio for sweet media. Your eyes will thank you.',
|
||||||
'thumbnail': r're:^https?://.*\.jpg$',
|
'thumbnail': r're:^https?://.*\.jpg$',
|
||||||
},
|
},
|
||||||
},
|
},
|
||||||
@ -1167,7 +1170,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'playlist_mincount': 5,
|
'playlist_mincount': 5,
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': 'aanslagen-kopenhagen',
|
'id': 'aanslagen-kopenhagen',
|
||||||
'title': 'Aanslagen Kopenhagen | RTL Nieuws',
|
'title': 'Aanslagen Kopenhagen',
|
||||||
}
|
}
|
||||||
},
|
},
|
||||||
# Zapiks embed
|
# Zapiks embed
|
||||||
@ -1299,6 +1302,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'params': {
|
'params': {
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
},
|
},
|
||||||
|
'skip': 'This video is unavailable.',
|
||||||
},
|
},
|
||||||
# Pladform embed
|
# Pladform embed
|
||||||
{
|
{
|
||||||
@ -1312,6 +1316,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'duration': 694,
|
'duration': 694,
|
||||||
'age_limit': 0,
|
'age_limit': 0,
|
||||||
},
|
},
|
||||||
|
'skip': 'HTTP Error 404: Not Found',
|
||||||
},
|
},
|
||||||
# Playwire embed
|
# Playwire embed
|
||||||
{
|
{
|
||||||
@ -1332,6 +1337,14 @@ class GenericIE(InfoExtractor):
|
|||||||
'id': '518726732',
|
'id': '518726732',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Facebook Creates "On This Day" | Crunch Report',
|
'title': 'Facebook Creates "On This Day" | Crunch Report',
|
||||||
|
'description': 'Amazon updates Fire TV line, Tesla\'s Model X spotted in the wild',
|
||||||
|
'timestamp': 1427237531,
|
||||||
|
'uploader': 'Crunch Report',
|
||||||
|
'upload_date': '20150324',
|
||||||
|
},
|
||||||
|
'params': {
|
||||||
|
# m3u8 download
|
||||||
|
'skip_download': True,
|
||||||
},
|
},
|
||||||
},
|
},
|
||||||
# SVT embed
|
# SVT embed
|
||||||
@ -1383,16 +1396,20 @@ class GenericIE(InfoExtractor):
|
|||||||
'upload_date': '20140107',
|
'upload_date': '20140107',
|
||||||
'timestamp': 1389118457,
|
'timestamp': 1389118457,
|
||||||
},
|
},
|
||||||
|
'skip': 'Invalid Page URL',
|
||||||
},
|
},
|
||||||
# NBC News embed
|
# NBC News embed
|
||||||
{
|
{
|
||||||
'url': 'http://www.vulture.com/2016/06/letterman-couldnt-care-less-about-late-night.html',
|
'url': 'http://www.vulture.com/2016/06/letterman-couldnt-care-less-about-late-night.html',
|
||||||
'md5': '1aa589c675898ae6d37a17913cf68d66',
|
'md5': '1aa589c675898ae6d37a17913cf68d66',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '701714499682',
|
'id': 'x_dtl_oa_LettermanliftPR_160608',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'PREVIEW: On Assignment: David Letterman',
|
'title': 'David Letterman: A Preview',
|
||||||
'description': 'A preview of Tom Brokaw\'s interview with David Letterman as part of the On Assignment series powered by Dateline. Airs Sunday June 12 at 7/6c.',
|
'description': 'A preview of Tom Brokaw\'s interview with David Letterman as part of the On Assignment series powered by Dateline. Airs Sunday June 12 at 7/6c.',
|
||||||
|
'upload_date': '20160609',
|
||||||
|
'timestamp': 1465431544,
|
||||||
|
'uploader': 'NBCU-NEWS',
|
||||||
},
|
},
|
||||||
},
|
},
|
||||||
# UDN embed
|
# UDN embed
|
||||||
@ -1409,6 +1426,7 @@ class GenericIE(InfoExtractor):
|
|||||||
# m3u8 download
|
# m3u8 download
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
},
|
},
|
||||||
|
'expected_warnings': ['Failed to parse JSON Expecting value'],
|
||||||
},
|
},
|
||||||
# Ooyala embed
|
# Ooyala embed
|
||||||
{
|
{
|
||||||
@ -1416,7 +1434,7 @@ class GenericIE(InfoExtractor):
|
|||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '50YnY4czr4ms1vJ7yz3xzq0excz_pUMs',
|
'id': '50YnY4czr4ms1vJ7yz3xzq0excz_pUMs',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'description': 'VIDEO: INDEX/MATCH versus VLOOKUP.',
|
'description': 'Index/Match versus VLOOKUP.',
|
||||||
'title': 'This is what separates the Excel masters from the wannabes',
|
'title': 'This is what separates the Excel masters from the wannabes',
|
||||||
'duration': 191.933,
|
'duration': 191.933,
|
||||||
},
|
},
|
||||||
@ -1454,7 +1472,8 @@ class GenericIE(InfoExtractor):
|
|||||||
'upload_date': '20150622',
|
'upload_date': '20150622',
|
||||||
'uploader': 'Public Sénat',
|
'uploader': 'Public Sénat',
|
||||||
'uploader_id': 'xa9gza',
|
'uploader_id': 'xa9gza',
|
||||||
}
|
},
|
||||||
|
'skip': 'File not found.',
|
||||||
},
|
},
|
||||||
# OnionStudios embed
|
# OnionStudios embed
|
||||||
{
|
{
|
||||||
@ -1612,22 +1631,6 @@ class GenericIE(InfoExtractor):
|
|||||||
},
|
},
|
||||||
'add_ie': ['BrightcoveLegacy'],
|
'add_ie': ['BrightcoveLegacy'],
|
||||||
},
|
},
|
||||||
# Nexx embed
|
|
||||||
{
|
|
||||||
'url': 'https://www.funk.net/serien/5940e15073f6120001657956/items/593efbb173f6120001657503',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '247746',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': "Yesterday's Jam (OV)",
|
|
||||||
'description': 'md5:09bc0984723fed34e2581624a84e05f0',
|
|
||||||
'timestamp': 1492594816,
|
|
||||||
'upload_date': '20170419',
|
|
||||||
},
|
|
||||||
'params': {
|
|
||||||
'format': 'bestvideo',
|
|
||||||
'skip_download': True,
|
|
||||||
},
|
|
||||||
},
|
|
||||||
# Facebook <iframe> embed
|
# Facebook <iframe> embed
|
||||||
{
|
{
|
||||||
'url': 'https://www.hostblogger.de/blog/archives/6181-Auto-jagt-Betonmischer.html',
|
'url': 'https://www.hostblogger.de/blog/archives/6181-Auto-jagt-Betonmischer.html',
|
||||||
@ -2206,7 +2209,7 @@ class GenericIE(InfoExtractor):
|
|||||||
# And then there are the jokers who advertise that they use RTA,
|
# And then there are the jokers who advertise that they use RTA,
|
||||||
# but actually don't.
|
# but actually don't.
|
||||||
AGE_LIMIT_MARKERS = [
|
AGE_LIMIT_MARKERS = [
|
||||||
r'Proudly Labeled <a href="http://www.rtalabel.org/" title="Restricted to Adults">RTA</a>',
|
r'Proudly Labeled <a href="http://www\.rtalabel\.org/" title="Restricted to Adults">RTA</a>',
|
||||||
]
|
]
|
||||||
if any(re.search(marker, webpage) for marker in AGE_LIMIT_MARKERS):
|
if any(re.search(marker, webpage) for marker in AGE_LIMIT_MARKERS):
|
||||||
age_limit = 18
|
age_limit = 18
|
||||||
@ -2268,7 +2271,7 @@ class GenericIE(InfoExtractor):
|
|||||||
|
|
||||||
# Look for embedded rtl.nl player
|
# Look for embedded rtl.nl player
|
||||||
matches = re.findall(
|
matches = re.findall(
|
||||||
r'<iframe[^>]+?src="((?:https?:)?//(?:www\.)?rtl\.nl/system/videoplayer/[^"]+(?:video_)?embed[^"]+)"',
|
r'<iframe[^>]+?src="((?:https?:)?//(?:(?:www|static)\.)?rtl\.nl/(?:system/videoplayer/[^"]+(?:video_)?)?embed[^"]+)"',
|
||||||
webpage)
|
webpage)
|
||||||
if matches:
|
if matches:
|
||||||
return self.playlist_from_matches(matches, video_id, video_title, ie='RtlNl')
|
return self.playlist_from_matches(matches, video_id, video_title, ie='RtlNl')
|
||||||
@ -2667,7 +2670,7 @@ class GenericIE(InfoExtractor):
|
|||||||
|
|
||||||
# Look for UDN embeds
|
# Look for UDN embeds
|
||||||
mobj = re.search(
|
mobj = re.search(
|
||||||
r'<iframe[^>]+src="(?P<url>%s)"' % UDNEmbedIE._PROTOCOL_RELATIVE_VALID_URL, webpage)
|
r'<iframe[^>]+src="(?:https?:)?(?P<url>%s)"' % UDNEmbedIE._PROTOCOL_RELATIVE_VALID_URL, webpage)
|
||||||
if mobj is not None:
|
if mobj is not None:
|
||||||
return self.url_result(
|
return self.url_result(
|
||||||
compat_urlparse.urljoin(url, mobj.group('url')), 'UDNEmbed')
|
compat_urlparse.urljoin(url, mobj.group('url')), 'UDNEmbed')
|
||||||
@ -2871,6 +2874,11 @@ class GenericIE(InfoExtractor):
|
|||||||
return self.playlist_from_matches(
|
return self.playlist_from_matches(
|
||||||
vzaar_urls, video_id, video_title, ie=VzaarIE.ie_key())
|
vzaar_urls, video_id, video_title, ie=VzaarIE.ie_key())
|
||||||
|
|
||||||
|
channel9_urls = Channel9IE._extract_urls(webpage)
|
||||||
|
if channel9_urls:
|
||||||
|
return self.playlist_from_matches(
|
||||||
|
channel9_urls, video_id, video_title, ie=Channel9IE.ie_key())
|
||||||
|
|
||||||
def merge_dicts(dict1, dict2):
|
def merge_dicts(dict1, dict2):
|
||||||
merged = {}
|
merged = {}
|
||||||
for k, v in dict1.items():
|
for k, v in dict1.items():
|
||||||
|
22
youtube_dl/extractor/gigya.py
Normal file
22
youtube_dl/extractor/gigya.py
Normal file
@ -0,0 +1,22 @@
|
|||||||
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
|
||||||
|
from ..utils import (
|
||||||
|
ExtractorError,
|
||||||
|
urlencode_postdata,
|
||||||
|
)
|
||||||
|
|
||||||
|
|
||||||
|
class GigyaBaseIE(InfoExtractor):
|
||||||
|
def _gigya_login(self, auth_data):
|
||||||
|
auth_info = self._download_json(
|
||||||
|
'https://accounts.eu1.gigya.com/accounts.login', None,
|
||||||
|
note='Logging in', errnote='Unable to log in',
|
||||||
|
data=urlencode_postdata(auth_data))
|
||||||
|
|
||||||
|
error_message = auth_info.get('errorDetails') or auth_info.get('errorMessage')
|
||||||
|
if error_message:
|
||||||
|
raise ExtractorError(
|
||||||
|
'Unable to login: %s' % error_message, expected=True)
|
||||||
|
return auth_info
|
@ -61,7 +61,7 @@ class GooglePlusIE(InfoExtractor):
|
|||||||
'width': int(width),
|
'width': int(width),
|
||||||
'height': int(height),
|
'height': int(height),
|
||||||
} for width, height, video_url in re.findall(
|
} for width, height, video_url in re.findall(
|
||||||
r'\d+,(\d+),(\d+),"(https?://[^.]+\.googleusercontent.com.*?)"', webpage)]
|
r'\d+,(\d+),(\d+),"(https?://[^.]+\.googleusercontent\.com.*?)"', webpage)]
|
||||||
self._sort_formats(formats)
|
self._sort_formats(formats)
|
||||||
|
|
||||||
return {
|
return {
|
||||||
|
@ -11,45 +11,20 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class HowStuffWorksIE(InfoExtractor):
|
class HowStuffWorksIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://[\da-z-]+\.howstuffworks\.com/(?:[^/]+/)*(?:\d+-)?(?P<id>.+?)-video\.htm'
|
_VALID_URL = r'https?://[\da-z-]+\.(?:howstuffworks|stuff(?:(?:youshould|theydontwantyouto)know|toblowyourmind|momnevertoldyou)|(?:brain|car)stuffshow|fwthinking|geniusstuff)\.com/(?:[^/]+/)*(?:\d+-)?(?P<id>.+?)-video\.htm'
|
||||||
_TESTS = [
|
_TESTS = [
|
||||||
{
|
{
|
||||||
'url': 'http://adventure.howstuffworks.com/5266-cool-jobs-iditarod-musher-video.htm',
|
'url': 'http://www.stufftoblowyourmind.com/videos/optical-illusions-video.htm',
|
||||||
|
'md5': '76646a5acc0c92bf7cd66751ca5db94d',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '450221',
|
'id': '855410',
|
||||||
'ext': 'flv',
|
|
||||||
'title': 'Cool Jobs - Iditarod Musher',
|
|
||||||
'description': 'Cold sleds, freezing temps and warm dog breath... an Iditarod musher\'s dream. Kasey-Dee Gardner jumps on a sled to find out what the big deal is.',
|
|
||||||
'display_id': 'cool-jobs-iditarod-musher',
|
|
||||||
'thumbnail': r're:^https?://.*\.jpg$',
|
|
||||||
'duration': 161,
|
|
||||||
},
|
|
||||||
'skip': 'Video broken',
|
|
||||||
},
|
|
||||||
{
|
|
||||||
'url': 'http://adventure.howstuffworks.com/7199-survival-zone-food-and-water-in-the-savanna-video.htm',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '453464',
|
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Survival Zone: Food and Water In the Savanna',
|
'title': 'Your Trickster Brain: Optical Illusions -- Science on the Web',
|
||||||
'description': 'Learn how to find both food and water while trekking in the African savannah. In this video from the Discovery Channel.',
|
'description': 'md5:e374ff9561f6833ad076a8cc0a5ab2fb',
|
||||||
'display_id': 'survival-zone-food-and-water-in-the-savanna',
|
|
||||||
'thumbnail': r're:^https?://.*\.jpg$',
|
|
||||||
},
|
},
|
||||||
},
|
},
|
||||||
{
|
{
|
||||||
'url': 'http://entertainment.howstuffworks.com/arts/2706-sword-swallowing-1-by-dan-meyer-video.htm',
|
'url': 'http://shows.howstuffworks.com/more-shows/why-does-balloon-stick-to-hair-video.htm',
|
||||||
'info_dict': {
|
|
||||||
'id': '440011',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Sword Swallowing #1 by Dan Meyer',
|
|
||||||
'description': 'Video footage (1 of 3) used by permission of the owner Dan Meyer through Sword Swallowers Association International <www.swordswallow.org>',
|
|
||||||
'display_id': 'sword-swallowing-1-by-dan-meyer',
|
|
||||||
'thumbnail': r're:^https?://.*\.jpg$',
|
|
||||||
},
|
|
||||||
},
|
|
||||||
{
|
|
||||||
'url': 'http://shows.howstuffworks.com/stuff-to-blow-your-mind/optical-illusions-video.htm',
|
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
}
|
}
|
||||||
]
|
]
|
||||||
|
@ -104,7 +104,7 @@ class HRTiIE(HRTiBaseIE):
|
|||||||
(?:
|
(?:
|
||||||
hrti:(?P<short_id>[0-9]+)|
|
hrti:(?P<short_id>[0-9]+)|
|
||||||
https?://
|
https?://
|
||||||
hrti\.hrt\.hr/\#/video/show/(?P<id>[0-9]+)/(?P<display_id>[^/]+)?
|
hrti\.hrt\.hr/(?:\#/)?video/show/(?P<id>[0-9]+)/(?P<display_id>[^/]+)?
|
||||||
)
|
)
|
||||||
'''
|
'''
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
@ -129,6 +129,9 @@ class HRTiIE(HRTiBaseIE):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'hrti:2181385',
|
'url': 'hrti:2181385',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://hrti.hrt.hr/video/show/3873068/cuvar-dvorca-dramska-serija-14',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
@ -170,7 +173,7 @@ class HRTiIE(HRTiBaseIE):
|
|||||||
|
|
||||||
|
|
||||||
class HRTiPlaylistIE(HRTiBaseIE):
|
class HRTiPlaylistIE(HRTiBaseIE):
|
||||||
_VALID_URL = r'https?://hrti.hrt.hr/#/video/list/category/(?P<id>[0-9]+)/(?P<display_id>[^/]+)?'
|
_VALID_URL = r'https?://hrti\.hrt\.hr/(?:#/)?video/list/category/(?P<id>[0-9]+)/(?P<display_id>[^/]+)?'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://hrti.hrt.hr/#/video/list/category/212/ekumena',
|
'url': 'https://hrti.hrt.hr/#/video/list/category/212/ekumena',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -182,6 +185,9 @@ class HRTiPlaylistIE(HRTiBaseIE):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'https://hrti.hrt.hr/#/video/list/category/212/',
|
'url': 'https://hrti.hrt.hr/#/video/list/category/212/',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://hrti.hrt.hr/video/list/category/212/ekumena',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -203,7 +203,7 @@ class PCMagIE(IGNIE):
|
|||||||
_VALID_URL = r'https?://(?:www\.)?pcmag\.com/(?P<type>videos|article2)(/.+)?/(?P<name_or_id>.+)'
|
_VALID_URL = r'https?://(?:www\.)?pcmag\.com/(?P<type>videos|article2)(/.+)?/(?P<name_or_id>.+)'
|
||||||
IE_NAME = 'pcmag'
|
IE_NAME = 'pcmag'
|
||||||
|
|
||||||
_EMBED_RE = r'iframe.setAttribute\("src",\s*__util.objToUrlString\("http://widgets\.ign\.com/video/embed/content.html?[^"]*url=([^"]+)["&]'
|
_EMBED_RE = r'iframe\.setAttribute\("src",\s*__util.objToUrlString\("http://widgets\.ign\.com/video/embed/content\.html?[^"]*url=([^"]+)["&]'
|
||||||
|
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.pcmag.com/videos/2015/01/06/010615-whats-new-now-is-gogo-snooping-on-your-data',
|
'url': 'http://www.pcmag.com/videos/2015/01/06/010615-whats-new-now-is-gogo-snooping-on-your-data',
|
||||||
|
@ -8,7 +8,10 @@ from ..compat import (
|
|||||||
compat_urllib_parse_unquote,
|
compat_urllib_parse_unquote,
|
||||||
compat_urlparse,
|
compat_urlparse,
|
||||||
)
|
)
|
||||||
from ..utils import determine_ext
|
from ..utils import (
|
||||||
|
determine_ext,
|
||||||
|
update_url_query,
|
||||||
|
)
|
||||||
from .bokecc import BokeCCBaseIE
|
from .bokecc import BokeCCBaseIE
|
||||||
|
|
||||||
|
|
||||||
@ -68,21 +71,22 @@ class InfoQIE(BokeCCBaseIE):
|
|||||||
'play_path': playpath,
|
'play_path': playpath,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _extract_cookies(self, webpage):
|
def _extract_cf_auth(self, webpage):
|
||||||
policy = self._search_regex(r'InfoQConstants.scp\s*=\s*\'([^\']+)\'', webpage, 'policy')
|
policy = self._search_regex(r'InfoQConstants\.scp\s*=\s*\'([^\']+)\'', webpage, 'policy')
|
||||||
signature = self._search_regex(r'InfoQConstants.scs\s*=\s*\'([^\']+)\'', webpage, 'signature')
|
signature = self._search_regex(r'InfoQConstants\.scs\s*=\s*\'([^\']+)\'', webpage, 'signature')
|
||||||
key_pair_id = self._search_regex(r'InfoQConstants.sck\s*=\s*\'([^\']+)\'', webpage, 'key-pair-id')
|
key_pair_id = self._search_regex(r'InfoQConstants\.sck\s*=\s*\'([^\']+)\'', webpage, 'key-pair-id')
|
||||||
return 'CloudFront-Policy=%s; CloudFront-Signature=%s; CloudFront-Key-Pair-Id=%s' % (
|
return {
|
||||||
policy, signature, key_pair_id)
|
'Policy': policy,
|
||||||
|
'Signature': signature,
|
||||||
|
'Key-Pair-Id': key_pair_id,
|
||||||
|
}
|
||||||
|
|
||||||
def _extract_http_video(self, webpage):
|
def _extract_http_video(self, webpage):
|
||||||
http_video_url = self._search_regex(r'P\.s\s*=\s*\'([^\']+)\'', webpage, 'video URL')
|
http_video_url = self._search_regex(r'P\.s\s*=\s*\'([^\']+)\'', webpage, 'video URL')
|
||||||
|
http_video_url = update_url_query(http_video_url, self._extract_cf_auth(webpage))
|
||||||
return [{
|
return [{
|
||||||
'format_id': 'http_video',
|
'format_id': 'http_video',
|
||||||
'url': http_video_url,
|
'url': http_video_url,
|
||||||
'http_headers': {
|
|
||||||
'Cookie': self._extract_cookies(webpage)
|
|
||||||
},
|
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _extract_http_audio(self, webpage, video_id):
|
def _extract_http_audio(self, webpage, video_id):
|
||||||
@ -91,22 +95,20 @@ class InfoQIE(BokeCCBaseIE):
|
|||||||
if not http_audio_url:
|
if not http_audio_url:
|
||||||
return []
|
return []
|
||||||
|
|
||||||
cookies_header = {'Cookie': self._extract_cookies(webpage)}
|
|
||||||
|
|
||||||
# base URL is found in the Location header in the response returned by
|
# base URL is found in the Location header in the response returned by
|
||||||
# GET https://www.infoq.com/mp3download.action?filename=... when logged in.
|
# GET https://www.infoq.com/mp3download.action?filename=... when logged in.
|
||||||
http_audio_url = compat_urlparse.urljoin('http://res.infoq.com/downloads/mp3downloads/', http_audio_url)
|
http_audio_url = compat_urlparse.urljoin('http://res.infoq.com/downloads/mp3downloads/', http_audio_url)
|
||||||
|
http_audio_url = update_url_query(http_audio_url, self._extract_cf_auth(webpage))
|
||||||
|
|
||||||
# audio file seem to be missing some times even if there is a download link
|
# audio file seem to be missing some times even if there is a download link
|
||||||
# so probe URL to make sure
|
# so probe URL to make sure
|
||||||
if not self._is_valid_url(http_audio_url, video_id, headers=cookies_header):
|
if not self._is_valid_url(http_audio_url, video_id):
|
||||||
return []
|
return []
|
||||||
|
|
||||||
return [{
|
return [{
|
||||||
'format_id': 'http_audio',
|
'format_id': 'http_audio',
|
||||||
'url': http_audio_url,
|
'url': http_audio_url,
|
||||||
'vcodec': 'none',
|
'vcodec': 'none',
|
||||||
'http_headers': cookies_header,
|
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -30,7 +30,7 @@ class JeuxVideoIE(InfoExtractor):
|
|||||||
webpage = self._download_webpage(url, title)
|
webpage = self._download_webpage(url, title)
|
||||||
title = self._html_search_meta('name', webpage) or self._og_search_title(webpage)
|
title = self._html_search_meta('name', webpage) or self._og_search_title(webpage)
|
||||||
config_url = self._html_search_regex(
|
config_url = self._html_search_regex(
|
||||||
r'data-src(?:set-video)?="(/contenu/medias/video.php.*?)"',
|
r'data-src(?:set-video)?="(/contenu/medias/video\.php.*?)"',
|
||||||
webpage, 'config URL')
|
webpage, 'config URL')
|
||||||
config_url = 'http://www.jeuxvideo.com' + config_url
|
config_url = 'http://www.jeuxvideo.com' + config_url
|
||||||
|
|
||||||
|
@ -287,6 +287,9 @@ class KalturaIE(InfoExtractor):
|
|||||||
# skip for now.
|
# skip for now.
|
||||||
if f.get('fileExt') == 'chun':
|
if f.get('fileExt') == 'chun':
|
||||||
continue
|
continue
|
||||||
|
# DRM-protected video, cannot be decrypted
|
||||||
|
if f.get('fileExt') == 'wvm':
|
||||||
|
continue
|
||||||
if not f.get('fileExt'):
|
if not f.get('fileExt'):
|
||||||
# QT indicates QuickTime; some videos have broken fileExt
|
# QT indicates QuickTime; some videos have broken fileExt
|
||||||
if f.get('containerFormat') == 'qt':
|
if f.get('containerFormat') == 'qt':
|
||||||
|
@ -338,7 +338,7 @@ class LivestreamOriginalIE(InfoExtractor):
|
|||||||
info = {
|
info = {
|
||||||
'title': self._og_search_title(webpage),
|
'title': self._og_search_title(webpage),
|
||||||
'description': self._og_search_description(webpage),
|
'description': self._og_search_description(webpage),
|
||||||
'thumbnail': self._search_regex(r'channelLogo.src\s*=\s*"([^"]+)"', webpage, 'thumbnail', None),
|
'thumbnail': self._search_regex(r'channelLogo\.src\s*=\s*"([^"]+)"', webpage, 'thumbnail', None),
|
||||||
}
|
}
|
||||||
video_data = self._download_json(stream_url, content_id)
|
video_data = self._download_json(stream_url, content_id)
|
||||||
is_live = video_data.get('isLive')
|
is_live = video_data.get('isLive')
|
||||||
|
@ -11,7 +11,7 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class LnkGoIE(InfoExtractor):
|
class LnkGoIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?lnkgo\.alfa\.lt/visi-video/(?P<show>[^/]+)/ziurek-(?P<id>[A-Za-z0-9-]+)'
|
_VALID_URL = r'https?://(?:www\.)?lnkgo\.(?:alfa\.)?lt/visi-video/(?P<show>[^/]+)/ziurek-(?P<id>[A-Za-z0-9-]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://lnkgo.alfa.lt/visi-video/yra-kaip-yra/ziurek-yra-kaip-yra-162',
|
'url': 'http://lnkgo.alfa.lt/visi-video/yra-kaip-yra/ziurek-yra-kaip-yra-162',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -42,6 +42,9 @@ class LnkGoIE(InfoExtractor):
|
|||||||
'params': {
|
'params': {
|
||||||
'skip_download': True, # HLS download
|
'skip_download': True, # HLS download
|
||||||
},
|
},
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.lnkgo.lt/visi-video/aktualai-pratesimas/ziurek-putka-trys-klausimai',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
_AGE_LIMITS = {
|
_AGE_LIMITS = {
|
||||||
'N-7': 7,
|
'N-7': 7,
|
||||||
|
@ -5,7 +5,7 @@ from .common import InfoExtractor
|
|||||||
|
|
||||||
|
|
||||||
class MakerTVIE(InfoExtractor):
|
class MakerTVIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:(?:www\.)?maker\.tv/(?:[^/]+/)*video|makerplayer.com/embed/maker)/(?P<id>[a-zA-Z0-9]{12})'
|
_VALID_URL = r'https?://(?:(?:www\.)?maker\.tv/(?:[^/]+/)*video|makerplayer\.com/embed/maker)/(?P<id>[a-zA-Z0-9]{12})'
|
||||||
_TEST = {
|
_TEST = {
|
||||||
'url': 'http://www.maker.tv/video/Fh3QgymL9gsc',
|
'url': 'http://www.maker.tv/video/Fh3QgymL9gsc',
|
||||||
'md5': 'ca237a53a8eb20b6dc5bd60564d4ab3e',
|
'md5': 'ca237a53a8eb20b6dc5bd60564d4ab3e',
|
||||||
|
@ -22,7 +22,7 @@ class MangomoloBaseIE(InfoExtractor):
|
|||||||
|
|
||||||
format_url = self._html_search_regex(
|
format_url = self._html_search_regex(
|
||||||
[
|
[
|
||||||
r'file\s*:\s*"(https?://[^"]+?/playlist.m3u8)',
|
r'file\s*:\s*"(https?://[^"]+?/playlist\.m3u8)',
|
||||||
r'<a[^>]+href="(rtsp://[^"]+)"'
|
r'<a[^>]+href="(rtsp://[^"]+)"'
|
||||||
], webpage, 'format url')
|
], webpage, 'format url')
|
||||||
formats = self._extract_wowza_formats(
|
formats = self._extract_wowza_formats(
|
||||||
|
@ -2,19 +2,18 @@ from __future__ import unicode_literals
|
|||||||
|
|
||||||
import re
|
import re
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .gigya import GigyaBaseIE
|
||||||
|
|
||||||
from ..compat import compat_str
|
from ..compat import compat_str
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
ExtractorError,
|
|
||||||
int_or_none,
|
int_or_none,
|
||||||
parse_duration,
|
parse_duration,
|
||||||
try_get,
|
try_get,
|
||||||
unified_timestamp,
|
unified_timestamp,
|
||||||
urlencode_postdata,
|
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
class MedialaanIE(InfoExtractor):
|
class MedialaanIE(GigyaBaseIE):
|
||||||
_VALID_URL = r'''(?x)
|
_VALID_URL = r'''(?x)
|
||||||
https?://
|
https?://
|
||||||
(?:www\.|nieuws\.)?
|
(?:www\.|nieuws\.)?
|
||||||
@ -119,15 +118,7 @@ class MedialaanIE(InfoExtractor):
|
|||||||
'password': password,
|
'password': password,
|
||||||
}
|
}
|
||||||
|
|
||||||
auth_info = self._download_json(
|
auth_info = self._gigya_login(auth_data)
|
||||||
'https://accounts.eu1.gigya.com/accounts.login', None,
|
|
||||||
note='Logging in', errnote='Unable to log in',
|
|
||||||
data=urlencode_postdata(auth_data))
|
|
||||||
|
|
||||||
error_message = auth_info.get('errorDetails') or auth_info.get('errorMessage')
|
|
||||||
if error_message:
|
|
||||||
raise ExtractorError(
|
|
||||||
'Unable to login: %s' % error_message, expected=True)
|
|
||||||
|
|
||||||
self._uid = auth_info['UID']
|
self._uid = auth_info['UID']
|
||||||
self._uid_signature = auth_info['UIDSignature']
|
self._uid_signature = auth_info['UIDSignature']
|
||||||
|
@ -18,7 +18,7 @@ class MegaphoneIE(InfoExtractor):
|
|||||||
'id': 'GLT9749789991',
|
'id': 'GLT9749789991',
|
||||||
'ext': 'mp3',
|
'ext': 'mp3',
|
||||||
'title': '#97 What Kind Of Idiot Gets Phished?',
|
'title': '#97 What Kind Of Idiot Gets Phished?',
|
||||||
'thumbnail': 're:^https://.*\.png.*$',
|
'thumbnail': r're:^https://.*\.png.*$',
|
||||||
'duration': 1776.26375,
|
'duration': 1776.26375,
|
||||||
'author': 'Reply All',
|
'author': 'Reply All',
|
||||||
},
|
},
|
||||||
|
@ -11,7 +11,7 @@ from ..utils import (
|
|||||||
|
|
||||||
class MeipaiIE(InfoExtractor):
|
class MeipaiIE(InfoExtractor):
|
||||||
IE_DESC = '美拍'
|
IE_DESC = '美拍'
|
||||||
_VALID_URL = r'https?://(?:www\.)?meipai.com/media/(?P<id>[0-9]+)'
|
_VALID_URL = r'https?://(?:www\.)?meipai\.com/media/(?P<id>[0-9]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
# regular uploaded video
|
# regular uploaded video
|
||||||
'url': 'http://www.meipai.com/media/531697625',
|
'url': 'http://www.meipai.com/media/531697625',
|
||||||
|
@ -291,7 +291,7 @@ class MixcloudUserIE(MixcloudPlaylistBaseIE):
|
|||||||
functools.partial(
|
functools.partial(
|
||||||
self._tracks_page_func,
|
self._tracks_page_func,
|
||||||
'%s/%s' % (user_id, list_type), video_id, 'list of %s' % list_type),
|
'%s/%s' % (user_id, list_type), video_id, 'list of %s' % list_type),
|
||||||
self._PAGE_SIZE, use_cache=True)
|
self._PAGE_SIZE)
|
||||||
|
|
||||||
return self.playlist_result(
|
return self.playlist_result(
|
||||||
entries, video_id, '%s (%s)' % (username, list_type), description)
|
entries, video_id, '%s (%s)' % (username, list_type), description)
|
||||||
|
@ -258,7 +258,7 @@ class MTVServicesInfoExtractor(InfoExtractor):
|
|||||||
|
|
||||||
if mgid is None or ':' not in mgid:
|
if mgid is None or ':' not in mgid:
|
||||||
mgid = self._search_regex(
|
mgid = self._search_regex(
|
||||||
[r'data-mgid="(.*?)"', r'swfobject.embedSWF\(".*?(mgid:.*?)"'],
|
[r'data-mgid="(.*?)"', r'swfobject\.embedSWF\(".*?(mgid:.*?)"'],
|
||||||
webpage, 'mgid', default=None)
|
webpage, 'mgid', default=None)
|
||||||
|
|
||||||
if not mgid:
|
if not mgid:
|
||||||
|
@ -1,177 +0,0 @@
|
|||||||
from __future__ import unicode_literals
|
|
||||||
|
|
||||||
import binascii
|
|
||||||
import base64
|
|
||||||
import hashlib
|
|
||||||
import re
|
|
||||||
import json
|
|
||||||
|
|
||||||
from .common import InfoExtractor
|
|
||||||
from ..compat import (
|
|
||||||
compat_ord,
|
|
||||||
compat_urllib_parse_unquote,
|
|
||||||
compat_urllib_parse_urlencode,
|
|
||||||
)
|
|
||||||
from ..utils import (
|
|
||||||
ExtractorError,
|
|
||||||
sanitized_Request,
|
|
||||||
)
|
|
||||||
|
|
||||||
|
|
||||||
class MyVideoIE(InfoExtractor):
|
|
||||||
_WORKING = False
|
|
||||||
_VALID_URL = r'https?://(?:www\.)?myvideo\.de/(?:[^/]+/)?watch/(?P<id>[0-9]+)/[^?/]+.*'
|
|
||||||
IE_NAME = 'myvideo'
|
|
||||||
_TEST = {
|
|
||||||
'url': 'http://www.myvideo.de/watch/8229274/bowling_fail_or_win',
|
|
||||||
'md5': '2d2753e8130479ba2cb7e0a37002053e',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '8229274',
|
|
||||||
'ext': 'flv',
|
|
||||||
'title': 'bowling-fail-or-win',
|
|
||||||
}
|
|
||||||
}
|
|
||||||
|
|
||||||
# Original Code from: https://github.com/dersphere/plugin.video.myvideo_de.git
|
|
||||||
# Released into the Public Domain by Tristan Fischer on 2013-05-19
|
|
||||||
# https://github.com/rg3/youtube-dl/pull/842
|
|
||||||
def __rc4crypt(self, data, key):
|
|
||||||
x = 0
|
|
||||||
box = list(range(256))
|
|
||||||
for i in list(range(256)):
|
|
||||||
x = (x + box[i] + compat_ord(key[i % len(key)])) % 256
|
|
||||||
box[i], box[x] = box[x], box[i]
|
|
||||||
x = 0
|
|
||||||
y = 0
|
|
||||||
out = ''
|
|
||||||
for char in data:
|
|
||||||
x = (x + 1) % 256
|
|
||||||
y = (y + box[x]) % 256
|
|
||||||
box[x], box[y] = box[y], box[x]
|
|
||||||
out += chr(compat_ord(char) ^ box[(box[x] + box[y]) % 256])
|
|
||||||
return out
|
|
||||||
|
|
||||||
def __md5(self, s):
|
|
||||||
return hashlib.md5(s).hexdigest().encode()
|
|
||||||
|
|
||||||
def _real_extract(self, url):
|
|
||||||
mobj = re.match(self._VALID_URL, url)
|
|
||||||
video_id = mobj.group('id')
|
|
||||||
|
|
||||||
GK = (
|
|
||||||
b'WXpnME1EZGhNRGhpTTJNM01XVmhOREU0WldNNVpHTTJOakpt'
|
|
||||||
b'TW1FMU5tVTBNR05pWkRaa05XRXhNVFJoWVRVd1ptSXhaVEV3'
|
|
||||||
b'TnpsbA0KTVRkbU1tSTRNdz09'
|
|
||||||
)
|
|
||||||
|
|
||||||
# Get video webpage
|
|
||||||
webpage_url = 'http://www.myvideo.de/watch/%s' % video_id
|
|
||||||
webpage = self._download_webpage(webpage_url, video_id)
|
|
||||||
|
|
||||||
mobj = re.search('source src=\'(.+?)[.]([^.]+)\'', webpage)
|
|
||||||
if mobj is not None:
|
|
||||||
self.report_extraction(video_id)
|
|
||||||
video_url = mobj.group(1) + '.flv'
|
|
||||||
|
|
||||||
video_title = self._html_search_regex('<title>([^<]+)</title>',
|
|
||||||
webpage, 'title')
|
|
||||||
|
|
||||||
return {
|
|
||||||
'id': video_id,
|
|
||||||
'url': video_url,
|
|
||||||
'title': video_title,
|
|
||||||
}
|
|
||||||
|
|
||||||
mobj = re.search(r'data-video-service="/service/data/video/%s/config' % video_id, webpage)
|
|
||||||
if mobj is not None:
|
|
||||||
request = sanitized_Request('http://www.myvideo.de/service/data/video/%s/config' % video_id, '')
|
|
||||||
response = self._download_webpage(request, video_id,
|
|
||||||
'Downloading video info')
|
|
||||||
info = json.loads(base64.b64decode(response).decode('utf-8'))
|
|
||||||
return {
|
|
||||||
'id': video_id,
|
|
||||||
'title': info['title'],
|
|
||||||
'url': info['streaming_url'].replace('rtmpe', 'rtmpt'),
|
|
||||||
'play_path': info['filename'],
|
|
||||||
'ext': 'flv',
|
|
||||||
'thumbnail': info['thumbnail'][0]['url'],
|
|
||||||
}
|
|
||||||
|
|
||||||
# try encxml
|
|
||||||
mobj = re.search('var flashvars={(.+?)}', webpage)
|
|
||||||
if mobj is None:
|
|
||||||
raise ExtractorError('Unable to extract video')
|
|
||||||
|
|
||||||
params = {}
|
|
||||||
encxml = ''
|
|
||||||
sec = mobj.group(1)
|
|
||||||
for (a, b) in re.findall('(.+?):\'(.+?)\',?', sec):
|
|
||||||
if not a == '_encxml':
|
|
||||||
params[a] = b
|
|
||||||
else:
|
|
||||||
encxml = compat_urllib_parse_unquote(b)
|
|
||||||
if not params.get('domain'):
|
|
||||||
params['domain'] = 'www.myvideo.de'
|
|
||||||
xmldata_url = '%s?%s' % (encxml, compat_urllib_parse_urlencode(params))
|
|
||||||
if 'flash_playertype=MTV' in xmldata_url:
|
|
||||||
self._downloader.report_warning('avoiding MTV player')
|
|
||||||
xmldata_url = (
|
|
||||||
'http://www.myvideo.de/dynamic/get_player_video_xml.php'
|
|
||||||
'?flash_playertype=D&ID=%s&_countlimit=4&autorun=yes'
|
|
||||||
) % video_id
|
|
||||||
|
|
||||||
# get enc data
|
|
||||||
enc_data = self._download_webpage(xmldata_url, video_id).split('=')[1]
|
|
||||||
enc_data_b = binascii.unhexlify(enc_data)
|
|
||||||
sk = self.__md5(
|
|
||||||
base64.b64decode(base64.b64decode(GK)) +
|
|
||||||
self.__md5(
|
|
||||||
str(video_id).encode('utf-8')
|
|
||||||
)
|
|
||||||
)
|
|
||||||
dec_data = self.__rc4crypt(enc_data_b, sk)
|
|
||||||
|
|
||||||
# extracting infos
|
|
||||||
self.report_extraction(video_id)
|
|
||||||
|
|
||||||
video_url = None
|
|
||||||
mobj = re.search('connectionurl=\'(.*?)\'', dec_data)
|
|
||||||
if mobj:
|
|
||||||
video_url = compat_urllib_parse_unquote(mobj.group(1))
|
|
||||||
if 'myvideo2flash' in video_url:
|
|
||||||
self.report_warning(
|
|
||||||
'Rewriting URL to use unencrypted rtmp:// ...',
|
|
||||||
video_id)
|
|
||||||
video_url = video_url.replace('rtmpe://', 'rtmp://')
|
|
||||||
|
|
||||||
if not video_url:
|
|
||||||
# extract non rtmp videos
|
|
||||||
mobj = re.search('path=\'(http.*?)\' source=\'(.*?)\'', dec_data)
|
|
||||||
if mobj is None:
|
|
||||||
raise ExtractorError('unable to extract url')
|
|
||||||
video_url = compat_urllib_parse_unquote(mobj.group(1)) + compat_urllib_parse_unquote(mobj.group(2))
|
|
||||||
|
|
||||||
video_file = self._search_regex('source=\'(.*?)\'', dec_data, 'video file')
|
|
||||||
video_file = compat_urllib_parse_unquote(video_file)
|
|
||||||
|
|
||||||
if not video_file.endswith('f4m'):
|
|
||||||
ppath, prefix = video_file.split('.')
|
|
||||||
video_playpath = '%s:%s' % (prefix, ppath)
|
|
||||||
else:
|
|
||||||
video_playpath = ''
|
|
||||||
|
|
||||||
video_swfobj = self._search_regex(r'swfobject.embedSWF\(\'(.+?)\'', webpage, 'swfobj')
|
|
||||||
video_swfobj = compat_urllib_parse_unquote(video_swfobj)
|
|
||||||
|
|
||||||
video_title = self._html_search_regex("<h1(?: class='globalHd')?>(.*?)</h1>",
|
|
||||||
webpage, 'title')
|
|
||||||
|
|
||||||
return {
|
|
||||||
'id': video_id,
|
|
||||||
'url': video_url,
|
|
||||||
'tc_url': video_url,
|
|
||||||
'title': video_title,
|
|
||||||
'ext': 'flv',
|
|
||||||
'play_path': video_playpath,
|
|
||||||
'player_url': video_swfobj,
|
|
||||||
}
|
|
@ -111,7 +111,7 @@ class NationalGeographicIE(ThePlatformIE, AdobePassIE):
|
|||||||
release_url = self._search_regex(
|
release_url = self._search_regex(
|
||||||
r'video_auth_playlist_url\s*=\s*"([^"]+)"',
|
r'video_auth_playlist_url\s*=\s*"([^"]+)"',
|
||||||
webpage, 'release url')
|
webpage, 'release url')
|
||||||
theplatform_path = self._search_regex(r'https?://link.theplatform.com/s/([^?]+)', release_url, 'theplatform path')
|
theplatform_path = self._search_regex(r'https?://link\.theplatform\.com/s/([^?]+)', release_url, 'theplatform path')
|
||||||
video_id = theplatform_path.split('/')[-1]
|
video_id = theplatform_path.split('/')[-1]
|
||||||
query = {
|
query = {
|
||||||
'mbr': 'true',
|
'mbr': 'true',
|
||||||
|
@ -43,7 +43,7 @@ class NaverIE(InfoExtractor):
|
|||||||
video_id = self._match_id(url)
|
video_id = self._match_id(url)
|
||||||
webpage = self._download_webpage(url, video_id)
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
m_id = re.search(r'var rmcPlayer = new nhn.rmcnmv.RMCVideoPlayer\("(.+?)", "(.+?)"',
|
m_id = re.search(r'var rmcPlayer = new nhn\.rmcnmv\.RMCVideoPlayer\("(.+?)", "(.+?)"',
|
||||||
webpage)
|
webpage)
|
||||||
if m_id is None:
|
if m_id is None:
|
||||||
error = self._html_search_regex(
|
error = self._html_search_regex(
|
||||||
|
@ -122,7 +122,7 @@ class NBAIE(TurnerBaseIE):
|
|||||||
playlist_title = self._og_search_title(webpage, fatal=False)
|
playlist_title = self._og_search_title(webpage, fatal=False)
|
||||||
entries = OnDemandPagedList(
|
entries = OnDemandPagedList(
|
||||||
functools.partial(self._fetch_page, team, video_id),
|
functools.partial(self._fetch_page, team, video_id),
|
||||||
self._PAGE_SIZE, use_cache=True)
|
self._PAGE_SIZE)
|
||||||
|
|
||||||
return self.playlist_result(entries, team, playlist_title)
|
return self.playlist_result(entries, team, playlist_title)
|
||||||
|
|
||||||
|
@ -15,7 +15,7 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class NBCIE(AdobePassIE):
|
class NBCIE(AdobePassIE):
|
||||||
_VALID_URL = r'https?(?P<permalink>://(?:www\.)?nbc\.com/[^/]+/video/[^/]+/(?P<id>n?\d+))'
|
_VALID_URL = r'https?(?P<permalink>://(?:www\.)?nbc\.com/(?:classic-tv/)?[^/]+/video/[^/]+/(?P<id>n?\d+))'
|
||||||
|
|
||||||
_TESTS = [
|
_TESTS = [
|
||||||
{
|
{
|
||||||
@ -67,7 +67,11 @@ class NBCIE(AdobePassIE):
|
|||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
},
|
},
|
||||||
'skip': 'Only works from US',
|
'skip': 'Only works from US',
|
||||||
}
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://www.nbc.com/classic-tv/charles-in-charge/video/charles-in-charge-pilot/n3310',
|
||||||
|
'only_matching': True,
|
||||||
|
},
|
||||||
]
|
]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -1,45 +1,106 @@
|
|||||||
|
# coding: utf-8
|
||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
|
from ..compat import (
|
||||||
|
compat_urllib_parse_unquote_plus
|
||||||
|
)
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
int_or_none,
|
parse_duration,
|
||||||
remove_end,
|
remove_end,
|
||||||
unified_strdate,
|
unified_strdate,
|
||||||
|
urljoin
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
class NDTVIE(InfoExtractor):
|
class NDTVIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?ndtv\.com/video/(?:[^/]+/)+[^/?^&]+-(?P<id>\d+)'
|
_VALID_URL = r'https?://(?:[^/]+\.)?ndtv\.com/(?:[^/]+/)*videos?/?(?:[^/]+/)*[^/?^&]+-(?P<id>\d+)'
|
||||||
|
|
||||||
_TEST = {
|
_TESTS = [
|
||||||
'url': 'http://www.ndtv.com/video/news/news/ndtv-exclusive-don-t-need-character-certificate-from-rahul-gandhi-says-arvind-kejriwal-300710',
|
{
|
||||||
'md5': '39f992dbe5fb531c395d8bbedb1e5e88',
|
'url': 'https://khabar.ndtv.com/video/show/prime-time/prime-time-ill-system-and-poor-education-468818',
|
||||||
|
'md5': '78efcf3880ef3fd9b83d405ca94a38eb',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '300710',
|
'id': '468818',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': "NDTV exclusive: Don't need character certificate from Rahul Gandhi, says Arvind Kejriwal",
|
'title': "प्राइम टाइम: सिस्टम बीमार, स्कूल बदहाल",
|
||||||
'description': 'md5:ab2d4b4a6056c5cb4caa6d729deabf02',
|
'description': 'md5:f410512f1b49672e5695dea16ef2731d',
|
||||||
'upload_date': '20131208',
|
'upload_date': '20170928',
|
||||||
'duration': 1327,
|
'duration': 2218,
|
||||||
'thumbnail': r're:https?://.*\.jpg',
|
'thumbnail': r're:https?://.*\.jpg',
|
||||||
},
|
|
||||||
}
|
}
|
||||||
|
},
|
||||||
|
{
|
||||||
|
# __filename is url
|
||||||
|
'url': 'http://movies.ndtv.com/videos/cracker-free-diwali-wishes-from-karan-johar-kriti-sanon-other-stars-470304',
|
||||||
|
'md5': 'f1d709352305b44443515ac56b45aa46',
|
||||||
|
'info_dict': {
|
||||||
|
'id': '470304',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': "Cracker-Free Diwali Wishes From Karan Johar, Kriti Sanon & Other Stars",
|
||||||
|
'description': 'md5:f115bba1adf2f6433fa7c1ade5feb465',
|
||||||
|
'upload_date': '20171019',
|
||||||
|
'duration': 137,
|
||||||
|
'thumbnail': r're:https?://.*\.jpg',
|
||||||
|
}
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://www.ndtv.com/video/news/news/delhi-s-air-quality-status-report-after-diwali-is-very-poor-470372',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://auto.ndtv.com/videos/the-cnb-daily-october-13-2017-469935',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://sports.ndtv.com/cricket/videos/2nd-t20i-rock-thrown-at-australia-cricket-team-bus-after-win-over-india-469764',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'http://gadgets.ndtv.com/videos/uncharted-the-lost-legacy-review-465568',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'http://profit.ndtv.com/videos/news/video-indian-economy-on-very-solid-track-international-monetary-fund-chief-470040',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'http://food.ndtv.com/video-basil-seeds-coconut-porridge-419083',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://doctor.ndtv.com/videos/top-health-stories-of-the-week-467396',
|
||||||
|
'only_matching': True
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://swirlster.ndtv.com/video/how-to-make-friends-at-work-469324',
|
||||||
|
'only_matching': True
|
||||||
|
}
|
||||||
|
]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
video_id = self._match_id(url)
|
video_id = self._match_id(url)
|
||||||
webpage = self._download_webpage(url, video_id)
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
title = remove_end(self._og_search_title(webpage), ' - NDTV')
|
# '__title' does not contain extra words such as sub-site name, "Video" etc.
|
||||||
|
title = compat_urllib_parse_unquote_plus(
|
||||||
|
self._search_regex(r"__title\s*=\s*'([^']+)'", webpage, 'title', default=None) or
|
||||||
|
self._og_search_title(webpage))
|
||||||
|
|
||||||
filename = self._search_regex(
|
filename = self._search_regex(
|
||||||
r"__filename='([^']+)'", webpage, 'video filename')
|
r"(?:__)?filename\s*[:=]\s*'([^']+)'", webpage, 'video filename')
|
||||||
video_url = 'http://bitcast-b.bitgravity.com/ndtvod/23372/ndtv/%s' % filename
|
# in "movies" sub-site pages, filename is URL
|
||||||
|
video_url = urljoin('https://ndtvod.bc-ssl.cdn.bitgravity.com/23372/ndtv/', filename.lstrip('/'))
|
||||||
|
|
||||||
duration = int_or_none(self._search_regex(
|
# "doctor" sub-site has MM:SS format
|
||||||
r"__duration='([^']+)'", webpage, 'duration', fatal=False))
|
duration = parse_duration(self._search_regex(
|
||||||
|
r"(?:__)?duration\s*[:=]\s*'([^']+)'", webpage, 'duration', fatal=False))
|
||||||
|
|
||||||
|
# "sports", "doctor", "swirlster" sub-sites don't have 'publish-date'
|
||||||
upload_date = unified_strdate(self._html_search_meta(
|
upload_date = unified_strdate(self._html_search_meta(
|
||||||
'publish-date', webpage, 'upload date', fatal=False))
|
'publish-date', webpage, 'upload date', default=None) or self._html_search_meta(
|
||||||
|
'uploadDate', webpage, 'upload date', default=None) or self._search_regex(
|
||||||
|
r'datePublished"\s*:\s*"([^"]+)"', webpage, 'upload date', fatal=False))
|
||||||
|
|
||||||
description = remove_end(self._og_search_description(webpage), ' (Read more)')
|
description = remove_end(self._og_search_description(webpage), ' (Read more)')
|
||||||
|
|
||||||
|
@ -18,7 +18,13 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class NexxIE(InfoExtractor):
|
class NexxIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://api\.nexx(?:\.cloud|cdn\.com)/v3/(?P<domain_id>\d+)/videos/byid/(?P<id>\d+)'
|
_VALID_URL = r'''(?x)
|
||||||
|
(?:
|
||||||
|
https?://api\.nexx(?:\.cloud|cdn\.com)/v3/(?P<domain_id>\d+)/videos/byid/|
|
||||||
|
nexx:(?P<domain_id_s>\d+):
|
||||||
|
)
|
||||||
|
(?P<id>\d+)
|
||||||
|
'''
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
# movie
|
# movie
|
||||||
'url': 'https://api.nexx.cloud/v3/748/videos/byid/128907',
|
'url': 'https://api.nexx.cloud/v3/748/videos/byid/128907',
|
||||||
@ -62,8 +68,18 @@ class NexxIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'https://api.nexxcdn.com/v3/748/videos/byid/128907',
|
'url': 'https://api.nexxcdn.com/v3/748/videos/byid/128907',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'nexx:748:128907',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
|
@staticmethod
|
||||||
|
def _extract_domain_id(webpage):
|
||||||
|
mobj = re.search(
|
||||||
|
r'<script\b[^>]+\bsrc=["\'](?:https?:)?//require\.nexx(?:\.cloud|cdn\.com)/(?P<id>\d+)',
|
||||||
|
webpage)
|
||||||
|
return mobj.group('id') if mobj else None
|
||||||
|
|
||||||
@staticmethod
|
@staticmethod
|
||||||
def _extract_urls(webpage):
|
def _extract_urls(webpage):
|
||||||
# Reference:
|
# Reference:
|
||||||
@ -72,11 +88,8 @@ class NexxIE(InfoExtractor):
|
|||||||
entries = []
|
entries = []
|
||||||
|
|
||||||
# JavaScript Integration
|
# JavaScript Integration
|
||||||
mobj = re.search(
|
domain_id = NexxIE._extract_domain_id(webpage)
|
||||||
r'<script\b[^>]+\bsrc=["\']https?://require\.nexx(?:\.cloud|cdn\.com)/(?P<id>\d+)',
|
if domain_id:
|
||||||
webpage)
|
|
||||||
if mobj:
|
|
||||||
domain_id = mobj.group('id')
|
|
||||||
for video_id in re.findall(
|
for video_id in re.findall(
|
||||||
r'(?is)onPLAYReady.+?_play\.init\s*\(.+?\s*,\s*["\']?(\d+)',
|
r'(?is)onPLAYReady.+?_play\.init\s*\(.+?\s*,\s*["\']?(\d+)',
|
||||||
webpage):
|
webpage):
|
||||||
@ -112,7 +125,8 @@ class NexxIE(InfoExtractor):
|
|||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
mobj = re.match(self._VALID_URL, url)
|
mobj = re.match(self._VALID_URL, url)
|
||||||
domain_id, video_id = mobj.group('domain_id', 'id')
|
domain_id = mobj.group('domain_id') or mobj.group('domain_id_s')
|
||||||
|
video_id = mobj.group('id')
|
||||||
|
|
||||||
# Reverse engineered from JS code (see getDeviceID function)
|
# Reverse engineered from JS code (see getDeviceID function)
|
||||||
device_id = '%d:%d:%d%d' % (
|
device_id = '%d:%d:%d%d' % (
|
||||||
|
@ -75,7 +75,7 @@ class NickIE(MTVServicesInfoExtractor):
|
|||||||
|
|
||||||
class NickDeIE(MTVServicesInfoExtractor):
|
class NickDeIE(MTVServicesInfoExtractor):
|
||||||
IE_NAME = 'nick.de'
|
IE_NAME = 'nick.de'
|
||||||
_VALID_URL = r'https?://(?:www\.)?(?P<host>nick\.(?:de|com\.pl)|nickelodeon\.(?:nl|at))/[^/]+/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
_VALID_URL = r'https?://(?:www\.)?(?P<host>nick\.(?:de|com\.pl|ch)|nickelodeon\.(?:nl|be|at|dk|no|se))/[^/]+/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.nick.de/playlist/3773-top-videos/videos/episode/17306-zu-wasser-und-zu-land-rauchende-erdnusse',
|
'url': 'http://www.nick.de/playlist/3773-top-videos/videos/episode/17306-zu-wasser-und-zu-land-rauchende-erdnusse',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
@ -91,6 +91,21 @@ class NickDeIE(MTVServicesInfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'http://www.nick.com.pl/seriale/474-spongebob-kanciastoporty/wideo/17412-teatr-to-jest-to-rodeo-oszolom',
|
'url': 'http://www.nick.com.pl/seriale/474-spongebob-kanciastoporty/wideo/17412-teatr-to-jest-to-rodeo-oszolom',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.no/program/2626-bulderhuset/videoer/90947-femteklasse-veronica-vs-vanzilla',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.dk/serier/2626-hojs-hus/videoer/761-tissepause',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.se/serier/2626-lugn-i-stormen/videos/998-',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nick.ch/shows/2304-adventure-time-abenteuerzeit-mit-finn-und-jake',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.be/afspeellijst/4530-top-videos/videos/episode/73917-inval-broodschapper-lariekoek-arie',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _extract_mrss_url(self, webpage, host):
|
def _extract_mrss_url(self, webpage, host):
|
||||||
@ -132,13 +147,28 @@ class NickNightIE(NickDeIE):
|
|||||||
|
|
||||||
class NickRuIE(MTVServicesInfoExtractor):
|
class NickRuIE(MTVServicesInfoExtractor):
|
||||||
IE_NAME = 'nickelodeonru'
|
IE_NAME = 'nickelodeonru'
|
||||||
_VALID_URL = r'https?://(?:www\.)nickelodeon\.ru/(?:playlist|shows|videos)/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
_VALID_URL = r'https?://(?:www\.)nickelodeon\.(?:ru|fr|es|pt|ro|hu)/[^/]+/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.nickelodeon.ru/shows/henrydanger/videos/episodes/3-sezon-15-seriya-licenziya-na-polyot/pmomfb#playlist/7airc6',
|
'url': 'http://www.nickelodeon.ru/shows/henrydanger/videos/episodes/3-sezon-15-seriya-licenziya-na-polyot/pmomfb#playlist/7airc6',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
}, {
|
}, {
|
||||||
'url': 'http://www.nickelodeon.ru/videos/smotri-na-nickelodeon-v-iyule/g9hvh7',
|
'url': 'http://www.nickelodeon.ru/videos/smotri-na-nickelodeon-v-iyule/g9hvh7',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.fr/programmes/bob-l-eponge/videos/le-marathon-de-booh-kini-bottom-mardi-31-octobre/nfn7z0',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.es/videos/nickelodeon-consejos-tortitas/f7w7xy',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.pt/series/spongebob-squarepants/videos/a-bolha-de-tinta-gigante/xutq1b',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.ro/emisiuni/shimmer-si-shine/video/nahal-din-bomboane/uw5u2k',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.nickelodeon.hu/musorok/spongyabob-kockanadrag/videok/episodes/buborekfujas-az-elszakadt-nadrag/q57iob#playlist/k6te4y',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -40,7 +40,7 @@ class NiconicoIE(InfoExtractor):
|
|||||||
'uploader': 'takuya0301',
|
'uploader': 'takuya0301',
|
||||||
'uploader_id': '2698420',
|
'uploader_id': '2698420',
|
||||||
'upload_date': '20131123',
|
'upload_date': '20131123',
|
||||||
'timestamp': 1385182762,
|
'timestamp': int, # timestamp is unstable
|
||||||
'description': '(c) copyright 2008, Blender Foundation / www.bigbuckbunny.org',
|
'description': '(c) copyright 2008, Blender Foundation / www.bigbuckbunny.org',
|
||||||
'duration': 33,
|
'duration': 33,
|
||||||
'view_count': int,
|
'view_count': int,
|
||||||
@ -115,8 +115,8 @@ class NiconicoIE(InfoExtractor):
|
|||||||
'skip': 'Requires an account',
|
'skip': 'Requires an account',
|
||||||
}, {
|
}, {
|
||||||
# "New" HTML5 video
|
# "New" HTML5 video
|
||||||
|
# md5 is unstable
|
||||||
'url': 'http://www.nicovideo.jp/watch/sm31464864',
|
'url': 'http://www.nicovideo.jp/watch/sm31464864',
|
||||||
'md5': '351647b4917660986dc0fa8864085135',
|
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': 'sm31464864',
|
'id': 'sm31464864',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
@ -124,7 +124,7 @@ class NiconicoIE(InfoExtractor):
|
|||||||
'description': 'md5:e52974af9a96e739196b2c1ca72b5feb',
|
'description': 'md5:e52974af9a96e739196b2c1ca72b5feb',
|
||||||
'timestamp': 1498514060,
|
'timestamp': 1498514060,
|
||||||
'upload_date': '20170626',
|
'upload_date': '20170626',
|
||||||
'uploader': 'ゲス',
|
'uploader': 'ゲスト',
|
||||||
'uploader_id': '40826363',
|
'uploader_id': '40826363',
|
||||||
'thumbnail': r're:https?://.*',
|
'thumbnail': r're:https?://.*',
|
||||||
'duration': 198,
|
'duration': 198,
|
||||||
@ -132,6 +132,25 @@ class NiconicoIE(InfoExtractor):
|
|||||||
'comment_count': int,
|
'comment_count': int,
|
||||||
},
|
},
|
||||||
'skip': 'Requires an account',
|
'skip': 'Requires an account',
|
||||||
|
}, {
|
||||||
|
# Video without owner
|
||||||
|
'url': 'http://www.nicovideo.jp/watch/sm18238488',
|
||||||
|
'md5': 'd265680a1f92bdcbbd2a507fc9e78a9e',
|
||||||
|
'info_dict': {
|
||||||
|
'id': 'sm18238488',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': '【実写版】ミュータントタートルズ',
|
||||||
|
'description': 'md5:15df8988e47a86f9e978af2064bf6d8e',
|
||||||
|
'timestamp': 1341160408,
|
||||||
|
'upload_date': '20120701',
|
||||||
|
'uploader': None,
|
||||||
|
'uploader_id': None,
|
||||||
|
'thumbnail': r're:https?://.*',
|
||||||
|
'duration': 5271,
|
||||||
|
'view_count': int,
|
||||||
|
'comment_count': int,
|
||||||
|
},
|
||||||
|
'skip': 'Requires an account',
|
||||||
}, {
|
}, {
|
||||||
'url': 'http://sp.nicovideo.jp/watch/sm28964488?ss_pos=1&cp_in=wt_tg',
|
'url': 'http://sp.nicovideo.jp/watch/sm28964488?ss_pos=1&cp_in=wt_tg',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
@ -395,7 +414,9 @@ class NiconicoIE(InfoExtractor):
|
|||||||
|
|
||||||
webpage_url = get_video_info('watch_url') or url
|
webpage_url = get_video_info('watch_url') or url
|
||||||
|
|
||||||
owner = api_data.get('owner', {})
|
# Note: cannot use api_data.get('owner', {}) because owner may be set to "null"
|
||||||
|
# in the JSON, which will cause None to be returned instead of {}.
|
||||||
|
owner = try_get(api_data, lambda x: x.get('owner'), dict) or {}
|
||||||
uploader_id = get_video_info(['ch_id', 'user_id']) or owner.get('id')
|
uploader_id = get_video_info(['ch_id', 'user_id']) or owner.get('id')
|
||||||
uploader = get_video_info(['ch_name', 'user_nickname']) or owner.get('nickname')
|
uploader = get_video_info(['ch_name', 'user_nickname']) or owner.get('nickname')
|
||||||
|
|
||||||
|
@ -469,7 +469,7 @@ class SchoolTVIE(NPODataMidEmbedIE):
|
|||||||
|
|
||||||
class HetKlokhuisIE(NPODataMidEmbedIE):
|
class HetKlokhuisIE(NPODataMidEmbedIE):
|
||||||
IE_NAME = 'hetklokhuis'
|
IE_NAME = 'hetklokhuis'
|
||||||
_VALID_URL = r'https?://(?:www\.)?hetklokhuis.nl/[^/]+/\d+/(?P<id>[^/?#&]+)'
|
_VALID_URL = r'https?://(?:www\.)?hetklokhuis\.nl/[^/]+/\d+/(?P<id>[^/?#&]+)'
|
||||||
|
|
||||||
_TEST = {
|
_TEST = {
|
||||||
'url': 'http://hetklokhuis.nl/tv-uitzending/3471/Zwaartekrachtsgolven',
|
'url': 'http://hetklokhuis.nl/tv-uitzending/3471/Zwaartekrachtsgolven',
|
||||||
|
@ -7,7 +7,7 @@ from .common import InfoExtractor
|
|||||||
|
|
||||||
|
|
||||||
class OnceIE(InfoExtractor):
|
class OnceIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://.+?\.unicornmedia\.com/now/[^/]+/[^/]+/(?P<domain_id>[^/]+)/(?P<application_id>[^/]+)/(?:[^/]+/)?(?P<media_item_id>[^/]+)/content\.(?:once|m3u8|mp4)'
|
_VALID_URL = r'https?://.+?\.unicornmedia\.com/now/(?:ads/vmap/)?[^/]+/[^/]+/(?P<domain_id>[^/]+)/(?P<application_id>[^/]+)/(?:[^/]+/)?(?P<media_item_id>[^/]+)/content\.(?:once|m3u8|mp4)'
|
||||||
ADAPTIVE_URL_TEMPLATE = 'http://once.unicornmedia.com/now/master/playlist/%s/%s/%s/content.m3u8'
|
ADAPTIVE_URL_TEMPLATE = 'http://once.unicornmedia.com/now/master/playlist/%s/%s/%s/content.m3u8'
|
||||||
PROGRESSIVE_URL_TEMPLATE = 'http://once.unicornmedia.com/now/media/progressive/%s/%s/%s/%s/content.mp4'
|
PROGRESSIVE_URL_TEMPLATE = 'http://once.unicornmedia.com/now/media/progressive/%s/%s/%s/%s/content.mp4'
|
||||||
|
|
||||||
|
@ -13,11 +13,11 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class OnionStudiosIE(InfoExtractor):
|
class OnionStudiosIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?onionstudios\.com/(?:videos/[^/]+-|embed\?.*\bid=)(?P<id>\d+)(?!-)'
|
_VALID_URL = r'https?://(?:www\.)?onionstudios\.com/(?:video(?:s/[^/]+-|/)|embed\?.*\bid=)(?P<id>\d+)(?!-)'
|
||||||
|
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.onionstudios.com/videos/hannibal-charges-forward-stops-for-a-cocktail-2937',
|
'url': 'http://www.onionstudios.com/videos/hannibal-charges-forward-stops-for-a-cocktail-2937',
|
||||||
'md5': 'e49f947c105b8a78a675a0ee1bddedfe',
|
'md5': '719d1f8c32094b8c33902c17bcae5e34',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '2937',
|
'id': '2937',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
@ -29,12 +29,15 @@ class OnionStudiosIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'http://www.onionstudios.com/embed?id=2855&autoplay=true',
|
'url': 'http://www.onionstudios.com/embed?id=2855&autoplay=true',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://www.onionstudios.com/video/6139.json',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
@staticmethod
|
@staticmethod
|
||||||
def _extract_url(webpage):
|
def _extract_url(webpage):
|
||||||
mobj = re.search(
|
mobj = re.search(
|
||||||
r'<iframe[^>]+?src=(["\'])(?P<url>(?:https?:)?//(?:www\.)?onionstudios\.com/embed.+?)\1', webpage)
|
r'(?s)<(?:iframe|bulbs-video)[^>]+?src=(["\'])(?P<url>(?:https?:)?//(?:www\.)?onionstudios\.com/(?:embed.+?|video/\d+\.json))\1', webpage)
|
||||||
if mobj:
|
if mobj:
|
||||||
return mobj.group('url')
|
return mobj.group('url')
|
||||||
|
|
||||||
|
@ -11,7 +11,7 @@ class ParliamentLiveUKIE(InfoExtractor):
|
|||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://parliamentlive.tv/Event/Index/c1e9d44d-fd6c-4263-b50f-97ed26cc998b',
|
'url': 'http://parliamentlive.tv/Event/Index/c1e9d44d-fd6c-4263-b50f-97ed26cc998b',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': 'c1e9d44d-fd6c-4263-b50f-97ed26cc998b',
|
'id': '1_af9nv9ym',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Home Affairs Committee',
|
'title': 'Home Affairs Committee',
|
||||||
'uploader_id': 'FFMPEG-01',
|
'uploader_id': 'FFMPEG-01',
|
||||||
@ -28,14 +28,14 @@ class ParliamentLiveUKIE(InfoExtractor):
|
|||||||
webpage = self._download_webpage(
|
webpage = self._download_webpage(
|
||||||
'http://vodplayer.parliamentlive.tv/?mid=' + video_id, video_id)
|
'http://vodplayer.parliamentlive.tv/?mid=' + video_id, video_id)
|
||||||
widget_config = self._parse_json(self._search_regex(
|
widget_config = self._parse_json(self._search_regex(
|
||||||
r'kWidgetConfig\s*=\s*({.+});',
|
r'(?s)kWidgetConfig\s*=\s*({.+});',
|
||||||
webpage, 'kaltura widget config'), video_id)
|
webpage, 'kaltura widget config'), video_id)
|
||||||
kaltura_url = 'kaltura:%s:%s' % (widget_config['wid'][1:], widget_config['entry_id'])
|
kaltura_url = 'kaltura:%s:%s' % (
|
||||||
|
widget_config['wid'][1:], widget_config['entry_id'])
|
||||||
event_title = self._download_json(
|
event_title = self._download_json(
|
||||||
'http://parliamentlive.tv/Event/GetShareVideo/' + video_id, video_id)['event']['title']
|
'http://parliamentlive.tv/Event/GetShareVideo/' + video_id, video_id)['event']['title']
|
||||||
return {
|
return {
|
||||||
'_type': 'url_transparent',
|
'_type': 'url_transparent',
|
||||||
'id': video_id,
|
|
||||||
'title': event_title,
|
'title': event_title,
|
||||||
'description': '',
|
'description': '',
|
||||||
'url': kaltura_url,
|
'url': kaltura_url,
|
||||||
|
@ -187,7 +187,7 @@ class PBSIE(InfoExtractor):
|
|||||||
_VALID_URL = r'''(?x)https?://
|
_VALID_URL = r'''(?x)https?://
|
||||||
(?:
|
(?:
|
||||||
# Direct video URL
|
# Direct video URL
|
||||||
(?:%s)/(?:viralplayer|video)/(?P<id>[0-9]+)/? |
|
(?:%s)/(?:(?:vir|port)alplayer|video)/(?P<id>[0-9]+)(?:[?/]|$) |
|
||||||
# Article with embedded player (or direct video)
|
# Article with embedded player (or direct video)
|
||||||
(?:www\.)?pbs\.org/(?:[^/]+/){1,5}(?P<presumptive_id>[^/]+?)(?:\.html)?/?(?:$|[?\#]) |
|
(?:www\.)?pbs\.org/(?:[^/]+/){1,5}(?P<presumptive_id>[^/]+?)(?:\.html)?/?(?:$|[?\#]) |
|
||||||
# Player
|
# Player
|
||||||
@ -367,6 +367,10 @@ class PBSIE(InfoExtractor):
|
|||||||
{
|
{
|
||||||
'url': 'http://watch.knpb.org/video/2365616055/',
|
'url': 'http://watch.knpb.org/video/2365616055/',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
},
|
||||||
|
{
|
||||||
|
'url': 'https://player.pbs.org/portalplayer/3004638221/?uid=',
|
||||||
|
'only_matching': True,
|
||||||
}
|
}
|
||||||
]
|
]
|
||||||
_ERRORS = {
|
_ERRORS = {
|
||||||
|
@ -14,7 +14,7 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class PornFlipIE(InfoExtractor):
|
class PornFlipIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?pornflip\.com/(?:v|embed)/(?P<id>[0-9A-Za-z]{11})'
|
_VALID_URL = r'https?://(?:www\.)?pornflip\.com/(?:v|embed)/(?P<id>[0-9A-Za-z-]{11})'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://www.pornflip.com/v/wz7DfNhMmep',
|
'url': 'https://www.pornflip.com/v/wz7DfNhMmep',
|
||||||
'md5': '98c46639849145ae1fd77af532a9278c',
|
'md5': '98c46639849145ae1fd77af532a9278c',
|
||||||
@ -34,6 +34,12 @@ class PornFlipIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'https://www.pornflip.com/embed/wz7DfNhMmep',
|
'url': 'https://www.pornflip.com/embed/wz7DfNhMmep',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://www.pornflip.com/v/EkRD6-vS2-s',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://www.pornflip.com/embed/EkRD6-vS2-s',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -1,5 +1,7 @@
|
|||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
import re
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
ExtractorError,
|
ExtractorError,
|
||||||
@ -35,6 +37,8 @@ class RedditIE(InfoExtractor):
|
|||||||
'https://v.redd.it/%s/DASHPlaylist.mpd' % video_id, video_id,
|
'https://v.redd.it/%s/DASHPlaylist.mpd' % video_id, video_id,
|
||||||
mpd_id='dash', fatal=False))
|
mpd_id='dash', fatal=False))
|
||||||
|
|
||||||
|
self._sort_formats(formats)
|
||||||
|
|
||||||
return {
|
return {
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
'title': video_id,
|
'title': video_id,
|
||||||
@ -43,7 +47,7 @@ class RedditIE(InfoExtractor):
|
|||||||
|
|
||||||
|
|
||||||
class RedditRIE(InfoExtractor):
|
class RedditRIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?reddit\.com/r/[^/]+/comments/(?P<id>[^/]+)'
|
_VALID_URL = r'(?P<url>https?://(?:www\.)?reddit\.com/r/[^/]+/comments/(?P<id>[^/?#&]+))'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://www.reddit.com/r/videos/comments/6rrwyj/that_small_heart_attack/',
|
'url': 'https://www.reddit.com/r/videos/comments/6rrwyj/that_small_heart_attack/',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -81,10 +85,13 @@ class RedditRIE(InfoExtractor):
|
|||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
mobj = re.match(self._VALID_URL, url)
|
||||||
|
url, video_id = mobj.group('url', 'id')
|
||||||
|
|
||||||
video_id = self._match_id(url)
|
video_id = self._match_id(url)
|
||||||
|
|
||||||
data = self._download_json(
|
data = self._download_json(
|
||||||
url + '.json', video_id)[0]['data']['children'][0]['data']
|
url + '/.json', video_id)[0]['data']['children'][0]['data']
|
||||||
|
|
||||||
video_url = data['url']
|
video_url = data['url']
|
||||||
|
|
||||||
|
@ -12,10 +12,10 @@ class RtlNlIE(InfoExtractor):
|
|||||||
IE_NAME = 'rtl.nl'
|
IE_NAME = 'rtl.nl'
|
||||||
IE_DESC = 'rtl.nl and rtlxl.nl'
|
IE_DESC = 'rtl.nl and rtlxl.nl'
|
||||||
_VALID_URL = r'''(?x)
|
_VALID_URL = r'''(?x)
|
||||||
https?://(?:www\.)?
|
https?://(?:(?:www|static)\.)?
|
||||||
(?:
|
(?:
|
||||||
rtlxl\.nl/[^\#]*\#!/[^/]+/|
|
rtlxl\.nl/[^\#]*\#!/[^/]+/|
|
||||||
rtl\.nl/(?:system/videoplayer/(?:[^/]+/)+(?:video_)?embed\.html\b.+?\buuid=|video/)
|
rtl\.nl/(?:(?:system/videoplayer/(?:[^/]+/)+(?:video_)?embed\.html|embed)\b.+?\buuid=|video/)
|
||||||
)
|
)
|
||||||
(?P<id>[0-9a-f-]+)'''
|
(?P<id>[0-9a-f-]+)'''
|
||||||
|
|
||||||
@ -73,6 +73,9 @@ class RtlNlIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'https://www.rtl.nl/video/c603c9c2-601d-4b5e-8175-64f1e942dc7d/',
|
'url': 'https://www.rtl.nl/video/c603c9c2-601d-4b5e-8175-64f1e942dc7d/',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://static.rtl.nl/embed/?uuid=1a2970fc-5c0b-43ff-9fdc-927e39e6d1bc&autoplay=false&publicatiepunt=rtlnieuwsnl',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -25,7 +25,7 @@ class RUHDIE(InfoExtractor):
|
|||||||
video_url = self._html_search_regex(
|
video_url = self._html_search_regex(
|
||||||
r'<param name="src" value="([^"]+)"', webpage, 'video url')
|
r'<param name="src" value="([^"]+)"', webpage, 'video url')
|
||||||
title = self._html_search_regex(
|
title = self._html_search_regex(
|
||||||
r'<title>([^<]+) RUHD.ru - Видео Высокого качества №1 в России!</title>',
|
r'<title>([^<]+) RUHD\.ru - Видео Высокого качества №1 в России!</title>',
|
||||||
webpage, 'title')
|
webpage, 'title')
|
||||||
description = self._html_search_regex(
|
description = self._html_search_regex(
|
||||||
r'(?s)<div id="longdesc">(.+?)<span id="showlink">',
|
r'(?s)<div id="longdesc">(.+?)<span id="showlink">',
|
||||||
|
@ -1,60 +1,190 @@
|
|||||||
# coding: utf-8
|
# coding: utf-8
|
||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
from .adobepass import AdobePassIE
|
import datetime
|
||||||
|
import json
|
||||||
|
import hashlib
|
||||||
|
import hmac
|
||||||
|
import re
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
from .anvato import AnvatoIE
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
int_or_none,
|
|
||||||
smuggle_url,
|
smuggle_url,
|
||||||
update_url_query,
|
urlencode_postdata,
|
||||||
|
xpath_text,
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
class ScrippsNetworksWatchIE(AdobePassIE):
|
class ScrippsNetworksWatchIE(InfoExtractor):
|
||||||
IE_NAME = 'scrippsnetworks:watch'
|
IE_NAME = 'scrippsnetworks:watch'
|
||||||
_VALID_URL = r'https?://watch\.(?:hgtv|foodnetwork|travelchannel|diynetwork|cookingchanneltv)\.com/player\.[A-Z0-9]+\.html#(?P<id>\d+)'
|
_VALID_URL = r'''(?x)
|
||||||
_TEST = {
|
https?://
|
||||||
'url': 'http://watch.hgtv.com/player.HNT.html#0256538',
|
watch\.
|
||||||
|
(?P<site>hgtv|foodnetwork|travelchannel|diynetwork|cookingchanneltv|geniuskitchen)\.com/
|
||||||
|
(?:
|
||||||
|
player\.[A-Z0-9]+\.html\#|
|
||||||
|
show/(?:[^/]+/){2}|
|
||||||
|
player/
|
||||||
|
)
|
||||||
|
(?P<id>\d+)
|
||||||
|
'''
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'http://watch.hgtv.com/show/HGTVE/Best-Ever-Treehouses/2241515/Best-Ever-Treehouses/',
|
||||||
'md5': '26545fd676d939954c6808274bdb905a',
|
'md5': '26545fd676d939954c6808274bdb905a',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '0256538',
|
'id': '4173834',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Seeking a Wow House',
|
'title': 'Best Ever Treehouses',
|
||||||
'description': 'Buyers retiring in Palm Springs, California, want a modern house with major wow factor. They\'re also looking for a pool and a large, open floorplan with tall windows looking out at the views.',
|
'description': "We're searching for the most over the top treehouses.",
|
||||||
'uploader': 'SCNI',
|
'uploader': 'ANV',
|
||||||
'upload_date': '20170207',
|
'upload_date': '20170922',
|
||||||
'timestamp': 1486450493,
|
'timestamp': 1506056400,
|
||||||
},
|
},
|
||||||
'skip': 'requires TV provider authentication',
|
'params': {
|
||||||
|
'skip_download': True,
|
||||||
|
},
|
||||||
|
'add_ie': [AnvatoIE.ie_key()],
|
||||||
|
}, {
|
||||||
|
'url': 'http://watch.diynetwork.com/show/DSAL/Salvage-Dawgs/2656646/Covington-Church/',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://watch.diynetwork.com/player.HNT.html#2656646',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://watch.geniuskitchen.com/player/3787617/Ample-Hills-Ice-Cream-Bike/',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
|
_SNI_TABLE = {
|
||||||
|
'hgtv': 'hgtv',
|
||||||
|
'diynetwork': 'diy',
|
||||||
|
'foodnetwork': 'food',
|
||||||
|
'cookingchanneltv': 'cook',
|
||||||
|
'travelchannel': 'trav',
|
||||||
|
'geniuskitchen': 'genius',
|
||||||
}
|
}
|
||||||
|
_SNI_HOST = 'web.api.video.snidigital.com'
|
||||||
|
|
||||||
|
_AWS_REGION = 'us-east-1'
|
||||||
|
_AWS_IDENTITY_ID_JSON = json.dumps({
|
||||||
|
'IdentityId': '%s:7655847c-0ae7-4d9b-80d6-56c062927eb3' % _AWS_REGION
|
||||||
|
})
|
||||||
|
_AWS_USER_AGENT = 'aws-sdk-js/2.80.0 callback'
|
||||||
|
_AWS_API_KEY = 'E7wSQmq0qK6xPrF13WmzKiHo4BQ7tip4pQcSXVl1'
|
||||||
|
_AWS_SERVICE = 'execute-api'
|
||||||
|
_AWS_REQUEST = 'aws4_request'
|
||||||
|
_AWS_SIGNED_HEADERS = ';'.join([
|
||||||
|
'host', 'x-amz-date', 'x-amz-security-token', 'x-api-key'])
|
||||||
|
_AWS_CANONICAL_REQUEST_TEMPLATE = '''GET
|
||||||
|
%(uri)s
|
||||||
|
|
||||||
|
host:%(host)s
|
||||||
|
x-amz-date:%(date)s
|
||||||
|
x-amz-security-token:%(token)s
|
||||||
|
x-api-key:%(key)s
|
||||||
|
|
||||||
|
%(signed_headers)s
|
||||||
|
%(payload_hash)s'''
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
video_id = self._match_id(url)
|
mobj = re.match(self._VALID_URL, url)
|
||||||
webpage = self._download_webpage(url, video_id)
|
site_id, video_id = mobj.group('site', 'id')
|
||||||
channel = self._parse_json(self._search_regex(
|
|
||||||
r'"channels"\s*:\s*(\[.+\])',
|
|
||||||
webpage, 'channels'), video_id)[0]
|
|
||||||
video_data = next(v for v in channel['videos'] if v.get('nlvid') == video_id)
|
|
||||||
title = video_data['title']
|
|
||||||
release_url = video_data['releaseUrl']
|
|
||||||
if video_data.get('restricted'):
|
|
||||||
requestor_id = self._search_regex(
|
|
||||||
r'requestorId\s*=\s*"([^"]+)";', webpage, 'requestor id')
|
|
||||||
resource = self._get_mvpd_resource(
|
|
||||||
requestor_id, title, video_id,
|
|
||||||
video_data.get('ratings', [{}])[0].get('rating'))
|
|
||||||
auth = self._extract_mvpd_auth(
|
|
||||||
url, video_id, requestor_id, resource)
|
|
||||||
release_url = update_url_query(release_url, {'auth': auth})
|
|
||||||
|
|
||||||
return {
|
def aws_hash(s):
|
||||||
'_type': 'url_transparent',
|
return hashlib.sha256(s.encode('utf-8')).hexdigest()
|
||||||
'id': video_id,
|
|
||||||
'title': title,
|
token = self._download_json(
|
||||||
'url': smuggle_url(release_url, {'force_smil_url': True}),
|
'https://cognito-identity.us-east-1.amazonaws.com/', video_id,
|
||||||
'description': video_data.get('description'),
|
data=self._AWS_IDENTITY_ID_JSON.encode('utf-8'),
|
||||||
'thumbnail': video_data.get('thumbnailUrl'),
|
headers={
|
||||||
'series': video_data.get('showTitle'),
|
'Accept': '*/*',
|
||||||
'season_number': int_or_none(video_data.get('season')),
|
'Content-Type': 'application/x-amz-json-1.1',
|
||||||
'episode_number': int_or_none(video_data.get('episodeNumber')),
|
'Referer': url,
|
||||||
'ie_key': 'ThePlatform',
|
'X-Amz-Content-Sha256': aws_hash(self._AWS_IDENTITY_ID_JSON),
|
||||||
|
'X-Amz-Target': 'AWSCognitoIdentityService.GetOpenIdToken',
|
||||||
|
'X-Amz-User-Agent': self._AWS_USER_AGENT,
|
||||||
|
})['Token']
|
||||||
|
|
||||||
|
sts = self._download_xml(
|
||||||
|
'https://sts.amazonaws.com/', video_id, data=urlencode_postdata({
|
||||||
|
'Action': 'AssumeRoleWithWebIdentity',
|
||||||
|
'RoleArn': 'arn:aws:iam::710330595350:role/Cognito_WebAPIUnauth_Role',
|
||||||
|
'RoleSessionName': 'web-identity',
|
||||||
|
'Version': '2011-06-15',
|
||||||
|
'WebIdentityToken': token,
|
||||||
|
}), headers={
|
||||||
|
'Referer': url,
|
||||||
|
'X-Amz-User-Agent': self._AWS_USER_AGENT,
|
||||||
|
'Content-Type': 'application/x-www-form-urlencoded; charset=utf-8',
|
||||||
|
})
|
||||||
|
|
||||||
|
def get(key):
|
||||||
|
return xpath_text(
|
||||||
|
sts, './/{https://sts.amazonaws.com/doc/2011-06-15/}%s' % key,
|
||||||
|
fatal=True)
|
||||||
|
|
||||||
|
access_key_id = get('AccessKeyId')
|
||||||
|
secret_access_key = get('SecretAccessKey')
|
||||||
|
session_token = get('SessionToken')
|
||||||
|
|
||||||
|
# Task 1: http://docs.aws.amazon.com/general/latest/gr/sigv4-create-canonical-request.html
|
||||||
|
uri = '/1/web/brands/%s/episodes/scrid/%s' % (self._SNI_TABLE[site_id], video_id)
|
||||||
|
datetime_now = datetime.datetime.utcnow().strftime('%Y%m%dT%H%M%SZ')
|
||||||
|
date = datetime_now[:8]
|
||||||
|
canonical_string = self._AWS_CANONICAL_REQUEST_TEMPLATE % {
|
||||||
|
'uri': uri,
|
||||||
|
'host': self._SNI_HOST,
|
||||||
|
'date': datetime_now,
|
||||||
|
'token': session_token,
|
||||||
|
'key': self._AWS_API_KEY,
|
||||||
|
'signed_headers': self._AWS_SIGNED_HEADERS,
|
||||||
|
'payload_hash': aws_hash(''),
|
||||||
}
|
}
|
||||||
|
|
||||||
|
# Task 2: http://docs.aws.amazon.com/general/latest/gr/sigv4-create-string-to-sign.html
|
||||||
|
credential_string = '/'.join([date, self._AWS_REGION, self._AWS_SERVICE, self._AWS_REQUEST])
|
||||||
|
string_to_sign = '\n'.join([
|
||||||
|
'AWS4-HMAC-SHA256', datetime_now, credential_string,
|
||||||
|
aws_hash(canonical_string)])
|
||||||
|
|
||||||
|
# Task 3: http://docs.aws.amazon.com/general/latest/gr/sigv4-calculate-signature.html
|
||||||
|
def aws_hmac(key, msg):
|
||||||
|
return hmac.new(key, msg.encode('utf-8'), hashlib.sha256)
|
||||||
|
|
||||||
|
def aws_hmac_digest(key, msg):
|
||||||
|
return aws_hmac(key, msg).digest()
|
||||||
|
|
||||||
|
def aws_hmac_hexdigest(key, msg):
|
||||||
|
return aws_hmac(key, msg).hexdigest()
|
||||||
|
|
||||||
|
k_secret = 'AWS4' + secret_access_key
|
||||||
|
k_date = aws_hmac_digest(k_secret.encode('utf-8'), date)
|
||||||
|
k_region = aws_hmac_digest(k_date, self._AWS_REGION)
|
||||||
|
k_service = aws_hmac_digest(k_region, self._AWS_SERVICE)
|
||||||
|
k_signing = aws_hmac_digest(k_service, self._AWS_REQUEST)
|
||||||
|
|
||||||
|
signature = aws_hmac_hexdigest(k_signing, string_to_sign)
|
||||||
|
|
||||||
|
auth_header = ', '.join([
|
||||||
|
'AWS4-HMAC-SHA256 Credential=%s' % '/'.join(
|
||||||
|
[access_key_id, date, self._AWS_REGION, self._AWS_SERVICE, self._AWS_REQUEST]),
|
||||||
|
'SignedHeaders=%s' % self._AWS_SIGNED_HEADERS,
|
||||||
|
'Signature=%s' % signature,
|
||||||
|
])
|
||||||
|
|
||||||
|
mcp_id = self._download_json(
|
||||||
|
'https://%s%s' % (self._SNI_HOST, uri), video_id, headers={
|
||||||
|
'Accept': '*/*',
|
||||||
|
'Referer': url,
|
||||||
|
'Authorization': auth_header,
|
||||||
|
'X-Amz-Date': datetime_now,
|
||||||
|
'X-Amz-Security-Token': session_token,
|
||||||
|
'X-Api-Key': self._AWS_API_KEY,
|
||||||
|
})['results'][0]['mcpId']
|
||||||
|
|
||||||
|
return self.url_result(
|
||||||
|
smuggle_url(
|
||||||
|
'anvato:anvato_scripps_app_web_prod_0837996dbe373629133857ae9eb72e740424d80a:%s' % mcp_id,
|
||||||
|
{'geo_countries': ['US']}),
|
||||||
|
AnvatoIE.ie_key(), video_id=mcp_id)
|
||||||
|
43
youtube_dl/extractor/servus.py
Normal file
43
youtube_dl/extractor/servus.py
Normal file
@ -0,0 +1,43 @@
|
|||||||
|
# coding: utf-8
|
||||||
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
|
||||||
|
|
||||||
|
class ServusIE(InfoExtractor):
|
||||||
|
_VALID_URL = r'https?://(?:www\.)?servus\.com/(?:at|de)/p/[^/]+/(?P<id>AA-\w+|\d+-\d+)'
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'https://www.servus.com/de/p/Die-Gr%C3%BCnen-aus-Sicht-des-Volkes/AA-1T6VBU5PW1W12/',
|
||||||
|
'md5': '046dee641cda1c4cabe13baef3be2c1c',
|
||||||
|
'info_dict': {
|
||||||
|
'id': 'AA-1T6VBU5PW1W12',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': 'Die Grünen aus Volkssicht',
|
||||||
|
'description': 'md5:052b5da1cb2cd7d562ef1f19be5a5cba',
|
||||||
|
'thumbnail': r're:^https?://.*\.jpg$',
|
||||||
|
}
|
||||||
|
}, {
|
||||||
|
'url': 'https://www.servus.com/at/p/Wie-das-Leben-beginnt/1309984137314-381415152/',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
video_id = self._match_id(url)
|
||||||
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
|
title = self._og_search_title(webpage)
|
||||||
|
description = self._og_search_description(webpage)
|
||||||
|
thumbnail = self._og_search_thumbnail(webpage)
|
||||||
|
|
||||||
|
formats = self._extract_m3u8_formats(
|
||||||
|
'https://stv.rbmbtnx.net/api/v1/manifests/%s.m3u8' % video_id,
|
||||||
|
video_id, 'mp4', entry_protocol='m3u8_native', m3u8_id='hls')
|
||||||
|
self._sort_formats(formats)
|
||||||
|
|
||||||
|
return {
|
||||||
|
'id': video_id,
|
||||||
|
'title': title,
|
||||||
|
'description': description,
|
||||||
|
'thumbnail': thumbnail,
|
||||||
|
'formats': formats,
|
||||||
|
}
|
@ -18,46 +18,32 @@ from ..utils import (
|
|||||||
|
|
||||||
class ShahidIE(InfoExtractor):
|
class ShahidIE(InfoExtractor):
|
||||||
_NETRC_MACHINE = 'shahid'
|
_NETRC_MACHINE = 'shahid'
|
||||||
_VALID_URL = r'https?://shahid\.mbc\.net/ar/(?P<type>episode|movie)/(?P<id>\d+)'
|
_VALID_URL = r'https?://shahid\.mbc\.net/ar/(?:serie|show|movie)s/[^/]+/(?P<type>episode|clip|movie)-(?P<id>\d+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://shahid.mbc.net/ar/episode/90574/%D8%A7%D9%84%D9%85%D9%84%D9%83-%D8%B9%D8%A8%D8%AF%D8%A7%D9%84%D9%84%D9%87-%D8%A7%D9%84%D8%A5%D9%86%D8%B3%D8%A7%D9%86-%D8%A7%D9%84%D9%85%D9%88%D8%B3%D9%85-1-%D9%83%D9%84%D9%8A%D8%A8-3.html',
|
'url': 'https://shahid.mbc.net/ar/shows/%D9%85%D8%AC%D9%84%D8%B3-%D8%A7%D9%84%D8%B4%D8%A8%D8%A7%D8%A8-%D8%A7%D9%84%D9%85%D9%88%D8%B3%D9%85-1-%D9%83%D9%84%D9%8A%D8%A8-1/clip-275286',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '90574',
|
'id': '275286',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'الملك عبدالله الإنسان الموسم 1 كليب 3',
|
'title': 'مجلس الشباب الموسم 1 كليب 1',
|
||||||
'description': 'الفيلم الوثائقي - الملك عبد الله الإنسان',
|
'timestamp': 1506988800,
|
||||||
'duration': 2972,
|
'upload_date': '20171003',
|
||||||
'timestamp': 1422057420,
|
|
||||||
'upload_date': '20150123',
|
|
||||||
},
|
},
|
||||||
'params': {
|
'params': {
|
||||||
# m3u8 download
|
# m3u8 download
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
}
|
}
|
||||||
}, {
|
}, {
|
||||||
'url': 'https://shahid.mbc.net/ar/movie/151746/%D8%A7%D9%84%D9%82%D9%86%D8%A7%D8%B5%D8%A9.html',
|
'url': 'https://shahid.mbc.net/ar/movies/%D8%A7%D9%84%D9%82%D9%86%D8%A7%D8%B5%D8%A9/movie-151746',
|
||||||
'only_matching': True
|
'only_matching': True
|
||||||
}, {
|
}, {
|
||||||
# shahid plus subscriber only
|
# shahid plus subscriber only
|
||||||
'url': 'https://shahid.mbc.net/ar/episode/90511/%D9%85%D8%B1%D8%A7%D9%8A%D8%A7-2011-%D8%A7%D9%84%D9%85%D9%88%D8%B3%D9%85-1-%D8%A7%D9%84%D8%AD%D9%84%D9%82%D8%A9-1.html',
|
'url': 'https://shahid.mbc.net/ar/series/%D9%85%D8%B1%D8%A7%D9%8A%D8%A7-2011-%D8%A7%D9%84%D9%85%D9%88%D8%B3%D9%85-1-%D8%A7%D9%84%D8%AD%D9%84%D9%82%D8%A9-1/episode-90511',
|
||||||
'only_matching': True
|
'only_matching': True
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_initialize(self):
|
def _api2_request(self, *args, **kwargs):
|
||||||
email, password = self._get_login_info()
|
|
||||||
if email is None:
|
|
||||||
return
|
|
||||||
|
|
||||||
try:
|
try:
|
||||||
user_data = self._download_json(
|
return self._download_json(*args, **kwargs)
|
||||||
'https://shahid.mbc.net/wd/service/users/login',
|
|
||||||
None, 'Logging in', data=json.dumps({
|
|
||||||
'email': email,
|
|
||||||
'password': password,
|
|
||||||
'basic': 'false',
|
|
||||||
}).encode('utf-8'), headers={
|
|
||||||
'Content-Type': 'application/json; charset=UTF-8',
|
|
||||||
})['user']
|
|
||||||
except ExtractorError as e:
|
except ExtractorError as e:
|
||||||
if isinstance(e.cause, compat_HTTPError):
|
if isinstance(e.cause, compat_HTTPError):
|
||||||
fail_data = self._parse_json(
|
fail_data = self._parse_json(
|
||||||
@ -69,6 +55,21 @@ class ShahidIE(InfoExtractor):
|
|||||||
raise ExtractorError(faults_message, expected=True)
|
raise ExtractorError(faults_message, expected=True)
|
||||||
raise
|
raise
|
||||||
|
|
||||||
|
def _real_initialize(self):
|
||||||
|
email, password = self._get_login_info()
|
||||||
|
if email is None:
|
||||||
|
return
|
||||||
|
|
||||||
|
user_data = self._api2_request(
|
||||||
|
'https://shahid.mbc.net/wd/service/users/login',
|
||||||
|
None, 'Logging in', data=json.dumps({
|
||||||
|
'email': email,
|
||||||
|
'password': password,
|
||||||
|
'basic': 'false',
|
||||||
|
}).encode('utf-8'), headers={
|
||||||
|
'Content-Type': 'application/json; charset=UTF-8',
|
||||||
|
})['user']
|
||||||
|
|
||||||
self._download_webpage(
|
self._download_webpage(
|
||||||
'https://shahid.mbc.net/populateContext',
|
'https://shahid.mbc.net/populateContext',
|
||||||
None, 'Populate Context', data=urlencode_postdata({
|
None, 'Populate Context', data=urlencode_postdata({
|
||||||
@ -93,15 +94,17 @@ class ShahidIE(InfoExtractor):
|
|||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
page_type, video_id = re.match(self._VALID_URL, url).groups()
|
page_type, video_id = re.match(self._VALID_URL, url).groups()
|
||||||
|
if page_type == 'clip':
|
||||||
|
page_type = 'episode'
|
||||||
|
|
||||||
player = self._get_api_data(self._download_json(
|
playout = self._api2_request(
|
||||||
'https://shahid.mbc.net/arContent/getPlayerContent-param-.id-%s.type-player.html' % video_id,
|
'https://api2.shahid.net/proxy/v2/playout/url/' + video_id,
|
||||||
video_id, 'Downloading player JSON'))
|
video_id, 'Downloading player JSON')['playout']
|
||||||
|
|
||||||
if player.get('drm'):
|
if playout.get('drm'):
|
||||||
raise ExtractorError('This video is DRM protected.', expected=True)
|
raise ExtractorError('This video is DRM protected.', expected=True)
|
||||||
|
|
||||||
formats = self._extract_m3u8_formats(player['url'], video_id, 'mp4')
|
formats = self._extract_m3u8_formats(playout['url'], video_id, 'mp4')
|
||||||
self._sort_formats(formats)
|
self._sort_formats(formats)
|
||||||
|
|
||||||
video = self._get_api_data(self._download_json(
|
video = self._get_api_data(self._download_json(
|
||||||
|
34
youtube_dl/extractor/slideslive.py
Normal file
34
youtube_dl/extractor/slideslive.py
Normal file
@ -0,0 +1,34 @@
|
|||||||
|
# coding: utf-8
|
||||||
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
from ..utils import ExtractorError
|
||||||
|
|
||||||
|
|
||||||
|
class SlidesLiveIE(InfoExtractor):
|
||||||
|
_VALID_URL = r'https?://slideslive\.com/(?P<id>[0-9]+)'
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'https://slideslive.com/38902413/gcc-ia16-backend',
|
||||||
|
'md5': 'b29fcd6c6952d0c79c5079b0e7a07e6f',
|
||||||
|
'info_dict': {
|
||||||
|
'id': 'LMtgR8ba0b0',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': '38902413: external video',
|
||||||
|
'description': '3890241320170925-9-1yd6ech.mp4',
|
||||||
|
'uploader': 'SlidesLive Administrator',
|
||||||
|
'uploader_id': 'UC62SdArr41t_-_fX40QCLRw',
|
||||||
|
'upload_date': '20170925',
|
||||||
|
}
|
||||||
|
}]
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
video_id = self._match_id(url)
|
||||||
|
video_data = self._download_json(
|
||||||
|
url, video_id, headers={'Accept': 'application/json'})
|
||||||
|
service_name = video_data['video_service_name']
|
||||||
|
if service_name == 'YOUTUBE':
|
||||||
|
yt_video_id = video_data['video_service_id']
|
||||||
|
return self.url_result(yt_video_id, 'Youtube', video_id=yt_video_id)
|
||||||
|
else:
|
||||||
|
raise ExtractorError(
|
||||||
|
'Unsupported service name: {0}'.format(service_name), expected=True)
|
@ -138,7 +138,7 @@ class SoundcloudIE(InfoExtractor):
|
|||||||
},
|
},
|
||||||
]
|
]
|
||||||
|
|
||||||
_CLIENT_ID = 'JlZIsxg2hY5WnBgtn3jfS0UYCl0K8DOg'
|
_CLIENT_ID = 'c6CU49JDMapyrQo06UxU9xouB9ZVzqCn'
|
||||||
_IPHONE_CLIENT_ID = '376f225bf427445fc4bfb6b99b72e0bf'
|
_IPHONE_CLIENT_ID = '376f225bf427445fc4bfb6b99b72e0bf'
|
||||||
|
|
||||||
@staticmethod
|
@staticmethod
|
||||||
|
@ -44,6 +44,7 @@ class SpikeIE(MTVServicesInfoExtractor):
|
|||||||
_FEED_URL = 'http://www.spike.com/feeds/mrss/'
|
_FEED_URL = 'http://www.spike.com/feeds/mrss/'
|
||||||
_MOBILE_TEMPLATE = 'http://m.spike.com/videos/video.rbml?id=%s'
|
_MOBILE_TEMPLATE = 'http://m.spike.com/videos/video.rbml?id=%s'
|
||||||
_CUSTOM_URL_REGEX = re.compile(r'spikenetworkapp://([^/]+/[-a-fA-F0-9]+)')
|
_CUSTOM_URL_REGEX = re.compile(r'spikenetworkapp://([^/]+/[-a-fA-F0-9]+)')
|
||||||
|
_GEO_COUNTRIES = ['US']
|
||||||
|
|
||||||
def _extract_mgid(self, webpage):
|
def _extract_mgid(self, webpage):
|
||||||
mgid = super(SpikeIE, self)._extract_mgid(webpage)
|
mgid = super(SpikeIE, self)._extract_mgid(webpage)
|
||||||
|
@ -66,7 +66,7 @@ class StanfordOpenClassroomIE(InfoExtractor):
|
|||||||
r'(?s)<description>([^<]+)</description>',
|
r'(?s)<description>([^<]+)</description>',
|
||||||
coursepage, 'description', fatal=False)
|
coursepage, 'description', fatal=False)
|
||||||
|
|
||||||
links = orderedSet(re.findall(r'<a href="(VideoPage.php\?[^"]+)">', coursepage))
|
links = orderedSet(re.findall(r'<a href="(VideoPage\.php\?[^"]+)">', coursepage))
|
||||||
info['entries'] = [self.url_result(
|
info['entries'] = [self.url_result(
|
||||||
'http://openclassroom.stanford.edu/MainFolder/%s' % unescapeHTML(l)
|
'http://openclassroom.stanford.edu/MainFolder/%s' % unescapeHTML(l)
|
||||||
) for l in links]
|
) for l in links]
|
||||||
@ -84,7 +84,7 @@ class StanfordOpenClassroomIE(InfoExtractor):
|
|||||||
rootpage = self._download_webpage(rootURL, info['id'],
|
rootpage = self._download_webpage(rootURL, info['id'],
|
||||||
errnote='Unable to download course info page')
|
errnote='Unable to download course info page')
|
||||||
|
|
||||||
links = orderedSet(re.findall(r'<a href="(CoursePage.php\?[^"]+)">', rootpage))
|
links = orderedSet(re.findall(r'<a href="(CoursePage\.php\?[^"]+)">', rootpage))
|
||||||
info['entries'] = [self.url_result(
|
info['entries'] = [self.url_result(
|
||||||
'http://openclassroom.stanford.edu/MainFolder/%s' % unescapeHTML(l)
|
'http://openclassroom.stanford.edu/MainFolder/%s' % unescapeHTML(l)
|
||||||
) for l in links]
|
) for l in links]
|
||||||
|
@ -4,8 +4,10 @@ import re
|
|||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
|
extract_attributes,
|
||||||
ExtractorError,
|
ExtractorError,
|
||||||
unescapeHTML,
|
get_element_by_class,
|
||||||
|
js_to_json,
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
@ -25,35 +27,39 @@ class SteamIE(InfoExtractor):
|
|||||||
'url': 'http://store.steampowered.com/video/105600/',
|
'url': 'http://store.steampowered.com/video/105600/',
|
||||||
'playlist': [
|
'playlist': [
|
||||||
{
|
{
|
||||||
'md5': 'f870007cee7065d7c76b88f0a45ecc07',
|
'md5': '6a294ee0c4b1f47f5bb76a65e31e3592',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '81300',
|
'id': '2040428',
|
||||||
'ext': 'flv',
|
'ext': 'mp4',
|
||||||
'title': 'Terraria 1.1 Trailer',
|
'title': 'Terraria 1.3 Trailer',
|
||||||
'playlist_index': 1,
|
'playlist_index': 1,
|
||||||
}
|
}
|
||||||
},
|
},
|
||||||
{
|
{
|
||||||
'md5': '61aaf31a5c5c3041afb58fb83cbb5751',
|
'md5': '911672b20064ca3263fa89650ba5a7aa',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '80859',
|
'id': '2029566',
|
||||||
'ext': 'flv',
|
'ext': 'mp4',
|
||||||
'title': 'Terraria Trailer',
|
'title': 'Terraria 1.2 Trailer',
|
||||||
'playlist_index': 2,
|
'playlist_index': 2,
|
||||||
}
|
}
|
||||||
}
|
}
|
||||||
],
|
],
|
||||||
|
'info_dict': {
|
||||||
|
'id': '105600',
|
||||||
|
'title': 'Terraria',
|
||||||
|
},
|
||||||
'params': {
|
'params': {
|
||||||
'playlistend': 2,
|
'playlistend': 2,
|
||||||
}
|
}
|
||||||
}, {
|
}, {
|
||||||
'url': 'http://steamcommunity.com/sharedfiles/filedetails/?id=242472205',
|
'url': 'http://steamcommunity.com/sharedfiles/filedetails/?id=242472205',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': 'WB5DvDOOvAY',
|
'id': 'X8kpJBlzD2E',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'upload_date': '20140329',
|
'upload_date': '20140617',
|
||||||
'title': 'FRONTIERS - Final Greenlight Trailer',
|
'title': 'FRONTIERS - Trapping',
|
||||||
'description': 'md5:dc96a773669d0ca1b36c13c1f30250d9',
|
'description': 'md5:bf6f7f773def614054089e5769c12a6e',
|
||||||
'uploader': 'AAD Productions',
|
'uploader': 'AAD Productions',
|
||||||
'uploader_id': 'AtomicAgeDogGames',
|
'uploader_id': 'AtomicAgeDogGames',
|
||||||
}
|
}
|
||||||
@ -76,48 +82,65 @@ class SteamIE(InfoExtractor):
|
|||||||
self.report_age_confirmation()
|
self.report_age_confirmation()
|
||||||
webpage = self._download_webpage(videourl, playlist_id)
|
webpage = self._download_webpage(videourl, playlist_id)
|
||||||
|
|
||||||
|
flash_vars = self._parse_json(self._search_regex(
|
||||||
|
r'(?s)rgMovieFlashvars\s*=\s*({.+?});', webpage,
|
||||||
|
'flash vars'), playlist_id, js_to_json)
|
||||||
|
|
||||||
|
playlist_title = None
|
||||||
|
entries = []
|
||||||
if fileID:
|
if fileID:
|
||||||
playlist_title = self._html_search_regex(
|
playlist_title = get_element_by_class('workshopItemTitle', webpage)
|
||||||
r'<div class="workshopItemTitle">(.+)</div>', webpage, 'title')
|
for movie in flash_vars.values():
|
||||||
mweb = re.finditer(r'''(?x)
|
if not movie:
|
||||||
'movie_(?P<videoID>[0-9]+)':\s*\{\s*
|
continue
|
||||||
YOUTUBE_VIDEO_ID:\s*"(?P<youtube_id>[^"]+)",
|
youtube_id = movie.get('YOUTUBE_VIDEO_ID')
|
||||||
''', webpage)
|
if not youtube_id:
|
||||||
videos = [{
|
continue
|
||||||
|
entries.append({
|
||||||
'_type': 'url',
|
'_type': 'url',
|
||||||
'url': vid.group('youtube_id'),
|
'url': youtube_id,
|
||||||
'ie_key': 'Youtube',
|
'ie_key': 'Youtube',
|
||||||
} for vid in mweb]
|
|
||||||
else:
|
|
||||||
playlist_title = self._html_search_regex(
|
|
||||||
r'<h2 class="pageheader">(.*?)</h2>', webpage, 'game title')
|
|
||||||
|
|
||||||
mweb = re.finditer(r'''(?x)
|
|
||||||
'movie_(?P<videoID>[0-9]+)':\s*\{\s*
|
|
||||||
FILENAME:\s*"(?P<videoURL>[\w:/\.\?=]+)"
|
|
||||||
(,\s*MOVIE_NAME:\s*\"(?P<videoName>[\w:/\.\?=\+-]+)\")?\s*\},
|
|
||||||
''', webpage)
|
|
||||||
titles = re.finditer(
|
|
||||||
r'<span class="title">(?P<videoName>.+?)</span>', webpage)
|
|
||||||
thumbs = re.finditer(
|
|
||||||
r'<img class="movie_thumb" src="(?P<thumbnail>.+?)">', webpage)
|
|
||||||
videos = []
|
|
||||||
|
|
||||||
for vid, vtitle, thumb in zip(mweb, titles, thumbs):
|
|
||||||
video_id = vid.group('videoID')
|
|
||||||
title = vtitle.group('videoName')
|
|
||||||
video_url = vid.group('videoURL')
|
|
||||||
video_thumb = thumb.group('thumbnail')
|
|
||||||
if not video_url:
|
|
||||||
raise ExtractorError('Cannot find video url for %s' % video_id)
|
|
||||||
videos.append({
|
|
||||||
'id': video_id,
|
|
||||||
'url': video_url,
|
|
||||||
'ext': 'flv',
|
|
||||||
'title': unescapeHTML(title),
|
|
||||||
'thumbnail': video_thumb
|
|
||||||
})
|
})
|
||||||
if not videos:
|
else:
|
||||||
|
playlist_title = get_element_by_class('apphub_AppName', webpage)
|
||||||
|
for movie_id, movie in flash_vars.items():
|
||||||
|
if not movie:
|
||||||
|
continue
|
||||||
|
video_id = self._search_regex(r'movie_(\d+)', movie_id, 'video id', fatal=False)
|
||||||
|
title = movie.get('MOVIE_NAME')
|
||||||
|
if not title or not video_id:
|
||||||
|
continue
|
||||||
|
entry = {
|
||||||
|
'id': video_id,
|
||||||
|
'title': title.replace('+', ' '),
|
||||||
|
}
|
||||||
|
formats = []
|
||||||
|
flv_url = movie.get('FILENAME')
|
||||||
|
if flv_url:
|
||||||
|
formats.append({
|
||||||
|
'format_id': 'flv',
|
||||||
|
'url': flv_url,
|
||||||
|
})
|
||||||
|
highlight_element = self._search_regex(
|
||||||
|
r'(<div[^>]+id="highlight_movie_%s"[^>]+>)' % video_id,
|
||||||
|
webpage, 'highlight element', fatal=False)
|
||||||
|
if highlight_element:
|
||||||
|
highlight_attribs = extract_attributes(highlight_element)
|
||||||
|
if highlight_attribs:
|
||||||
|
entry['thumbnail'] = highlight_attribs.get('data-poster')
|
||||||
|
for quality in ('', '-hd'):
|
||||||
|
for ext in ('webm', 'mp4'):
|
||||||
|
video_url = highlight_attribs.get('data-%s%s-source' % (ext, quality))
|
||||||
|
if video_url:
|
||||||
|
formats.append({
|
||||||
|
'format_id': ext + quality,
|
||||||
|
'url': video_url,
|
||||||
|
})
|
||||||
|
if not formats:
|
||||||
|
continue
|
||||||
|
entry['formats'] = formats
|
||||||
|
entries.append(entry)
|
||||||
|
if not entries:
|
||||||
raise ExtractorError('Could not find any videos')
|
raise ExtractorError('Could not find any videos')
|
||||||
|
|
||||||
return self.playlist_result(videos, playlist_id, playlist_title)
|
return self.playlist_result(entries, playlist_id, playlist_title)
|
||||||
|
@ -216,7 +216,7 @@ class ThePlatformIE(ThePlatformBaseIE, AdobePassIE):
|
|||||||
def hex_to_bytes(hex):
|
def hex_to_bytes(hex):
|
||||||
return binascii.a2b_hex(hex.encode('ascii'))
|
return binascii.a2b_hex(hex.encode('ascii'))
|
||||||
|
|
||||||
relative_path = re.match(r'https?://link.theplatform.com/s/([^?]+)', url).group(1)
|
relative_path = re.match(r'https?://link\.theplatform\.com/s/([^?]+)', url).group(1)
|
||||||
clear_text = hex_to_bytes(flags + expiration_date + str_to_hex(relative_path))
|
clear_text = hex_to_bytes(flags + expiration_date + str_to_hex(relative_path))
|
||||||
checksum = hmac.new(sig_key.encode('ascii'), clear_text, hashlib.sha1).hexdigest()
|
checksum = hmac.new(sig_key.encode('ascii'), clear_text, hashlib.sha1).hexdigest()
|
||||||
sig = flags + expiration_date + checksum + str_to_hex(sig_secret)
|
sig = flags + expiration_date + checksum + str_to_hex(sig_secret)
|
||||||
|
@ -57,10 +57,10 @@ class ThisAVIE(InfoExtractor):
|
|||||||
info_dict = self._extract_jwplayer_data(
|
info_dict = self._extract_jwplayer_data(
|
||||||
webpage, video_id, require_title=False)
|
webpage, video_id, require_title=False)
|
||||||
uploader = self._html_search_regex(
|
uploader = self._html_search_regex(
|
||||||
r': <a href="http://www.thisav.com/user/[0-9]+/(?:[^"]+)">([^<]+)</a>',
|
r': <a href="http://www\.thisav\.com/user/[0-9]+/(?:[^"]+)">([^<]+)</a>',
|
||||||
webpage, 'uploader name', fatal=False)
|
webpage, 'uploader name', fatal=False)
|
||||||
uploader_id = self._html_search_regex(
|
uploader_id = self._html_search_regex(
|
||||||
r': <a href="http://www.thisav.com/user/[0-9]+/([^"]+)">(?:[^<]+)</a>',
|
r': <a href="http://www\.thisav\.com/user/[0-9]+/([^"]+)">(?:[^<]+)</a>',
|
||||||
webpage, 'uploader id', fatal=False)
|
webpage, 'uploader id', fatal=False)
|
||||||
|
|
||||||
info_dict.update({
|
info_dict.update({
|
||||||
|
@ -13,11 +13,11 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class TubiTvIE(InfoExtractor):
|
class TubiTvIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?tubitv\.com/video/(?P<id>[0-9]+)'
|
_VALID_URL = r'https?://(?:www\.)?tubitv\.com/(?:video|movies|tv-shows)/(?P<id>[0-9]+)'
|
||||||
_LOGIN_URL = 'http://tubitv.com/login'
|
_LOGIN_URL = 'http://tubitv.com/login'
|
||||||
_NETRC_MACHINE = 'tubitv'
|
_NETRC_MACHINE = 'tubitv'
|
||||||
_GEO_COUNTRIES = ['US']
|
_GEO_COUNTRIES = ['US']
|
||||||
_TEST = {
|
_TESTS = [{
|
||||||
'url': 'http://tubitv.com/video/283829/the_comedian_at_the_friday',
|
'url': 'http://tubitv.com/video/283829/the_comedian_at_the_friday',
|
||||||
'md5': '43ac06be9326f41912dc64ccf7a80320',
|
'md5': '43ac06be9326f41912dc64ccf7a80320',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -27,7 +27,13 @@ class TubiTvIE(InfoExtractor):
|
|||||||
'description': 'A stand up comedian is forced to look at the decisions in his life while on a one week trip to the west coast.',
|
'description': 'A stand up comedian is forced to look at the decisions in his life while on a one week trip to the west coast.',
|
||||||
'uploader_id': 'bc168bee0d18dd1cb3b86c68706ab434',
|
'uploader_id': 'bc168bee0d18dd1cb3b86c68706ab434',
|
||||||
},
|
},
|
||||||
}
|
}, {
|
||||||
|
'url': 'http://tubitv.com/tv-shows/321886/s01_e01_on_nom_stories',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://tubitv.com/movies/383676/tracker',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
def _login(self):
|
def _login(self):
|
||||||
(username, password) = self._get_login_info()
|
(username, password) = self._get_login_info()
|
||||||
|
@ -3,52 +3,50 @@ from __future__ import unicode_literals
|
|||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
from ..utils import (
|
from ..utils import (
|
||||||
int_or_none,
|
float_or_none,
|
||||||
parse_iso8601,
|
|
||||||
smuggle_url,
|
smuggle_url,
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
class TVAIE(InfoExtractor):
|
class TVAIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://videos\.tva\.ca/episode/(?P<id>\d+)'
|
_VALID_URL = r'https?://videos\.tva\.ca/details/_(?P<id>\d+)'
|
||||||
_TEST = {
|
_TEST = {
|
||||||
'url': 'http://videos.tva.ca/episode/85538',
|
'url': 'https://videos.tva.ca/details/_5596811470001',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '85538',
|
'id': '5596811470001',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Épisode du 25 janvier 2017',
|
'title': 'Un extrait de l\'épisode du dimanche 8 octobre 2017 !',
|
||||||
'description': 'md5:e9e7fb5532ab37984d2dc87229cadf98',
|
'uploader_id': '5481942443001',
|
||||||
'upload_date': '20170126',
|
'upload_date': '20171003',
|
||||||
'timestamp': 1485442329,
|
'timestamp': 1507064617,
|
||||||
},
|
},
|
||||||
'params': {
|
'params': {
|
||||||
# m3u8 download
|
# m3u8 download
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
}
|
}
|
||||||
}
|
}
|
||||||
|
BRIGHTCOVE_URL_TEMPLATE = 'http://players.brightcove.net/5481942443001/default_default/index.html?videoId=%s'
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
video_id = self._match_id(url)
|
video_id = self._match_id(url)
|
||||||
video_data = self._download_json(
|
video_data = self._download_json(
|
||||||
"https://d18jmrhziuoi7p.cloudfront.net/isl/api/v1/dataservice/Items('%s')" % video_id,
|
'https://videos.tva.ca/proxy/item/_' + video_id, video_id, headers={
|
||||||
video_id, query={
|
'Accept': 'application/json',
|
||||||
'$expand': 'Metadata,CustomId',
|
|
||||||
'$select': 'Metadata,Id,Title,ShortDescription,LongDescription,CreatedDate,CustomId,AverageUserRating,Categories,ShowName',
|
|
||||||
'$format': 'json',
|
|
||||||
})
|
})
|
||||||
metadata = video_data.get('Metadata', {})
|
|
||||||
|
def get_attribute(key):
|
||||||
|
for attribute in video_data.get('attributes', []):
|
||||||
|
if attribute.get('key') == key:
|
||||||
|
return attribute.get('value')
|
||||||
|
return None
|
||||||
|
|
||||||
return {
|
return {
|
||||||
'_type': 'url_transparent',
|
'_type': 'url_transparent',
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
'title': video_data['Title'],
|
'title': get_attribute('title'),
|
||||||
'url': smuggle_url('ooyala:' + video_data['CustomId'], {'supportedformats': 'm3u8,hds'}),
|
'url': smuggle_url(self.BRIGHTCOVE_URL_TEMPLATE % video_id, {'geo_countries': ['CA']}),
|
||||||
'description': video_data.get('LongDescription') or video_data.get('ShortDescription'),
|
'description': get_attribute('description'),
|
||||||
'series': video_data.get('ShowName'),
|
'thumbnail': get_attribute('image-background') or get_attribute('image-landscape'),
|
||||||
'episode': metadata.get('EpisodeTitle'),
|
'duration': float_or_none(get_attribute('video-duration'), 1000),
|
||||||
'episode_number': int_or_none(metadata.get('EpisodeNumber')),
|
'ie_key': 'BrightcoveNew',
|
||||||
'categories': video_data.get('Categories'),
|
|
||||||
'average_rating': video_data.get('AverageUserRating'),
|
|
||||||
'timestamp': parse_iso8601(video_data.get('CreatedDate')),
|
|
||||||
'ie_key': 'Ooyala',
|
|
||||||
}
|
}
|
||||||
|
@ -609,7 +609,7 @@ class TwitchClipsIE(InfoExtractor):
|
|||||||
r'(?s)clipInfo\s*=\s*({.+?});', webpage, 'clip info'),
|
r'(?s)clipInfo\s*=\s*({.+?});', webpage, 'clip info'),
|
||||||
video_id, transform_source=js_to_json)
|
video_id, transform_source=js_to_json)
|
||||||
|
|
||||||
title = clip.get('channel_title') or self._og_search_title(webpage)
|
title = clip.get('title') or clip.get('channel_title') or self._og_search_title(webpage)
|
||||||
|
|
||||||
formats = [{
|
formats = [{
|
||||||
'url': option['source'],
|
'url': option['source'],
|
||||||
|
@ -174,7 +174,7 @@ class TwitterCardIE(TwitterBaseIE):
|
|||||||
webpage = self._download_webpage(url, video_id)
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
iframe_url = self._html_search_regex(
|
iframe_url = self._html_search_regex(
|
||||||
r'<iframe[^>]+src="((?:https?:)?//(?:www.youtube.com/embed/[^"]+|(?:www\.)?vine\.co/v/\w+/card))"',
|
r'<iframe[^>]+src="((?:https?:)?//(?:www\.youtube\.com/embed/[^"]+|(?:www\.)?vine\.co/v/\w+/card))"',
|
||||||
webpage, 'video iframe', default=None)
|
webpage, 'video iframe', default=None)
|
||||||
if iframe_url:
|
if iframe_url:
|
||||||
return self.url_result(iframe_url)
|
return self.url_result(iframe_url)
|
||||||
|
@ -1,7 +1,6 @@
|
|||||||
# coding: utf-8
|
# coding: utf-8
|
||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
import json
|
|
||||||
import re
|
import re
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
@ -29,6 +28,7 @@ class UDNEmbedIE(InfoExtractor):
|
|||||||
# m3u8 download
|
# m3u8 download
|
||||||
'skip_download': True,
|
'skip_download': True,
|
||||||
},
|
},
|
||||||
|
'expected_warnings': ['Failed to parse JSON Expecting value'],
|
||||||
}, {
|
}, {
|
||||||
'url': 'https://video.udn.com/embed/news/300040',
|
'url': 'https://video.udn.com/embed/news/300040',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
@ -43,10 +43,21 @@ class UDNEmbedIE(InfoExtractor):
|
|||||||
|
|
||||||
page = self._download_webpage(url, video_id)
|
page = self._download_webpage(url, video_id)
|
||||||
|
|
||||||
options = json.loads(js_to_json(self._html_search_regex(
|
options_str = self._html_search_regex(
|
||||||
r'var\s+options\s*=\s*([^;]+);', page, 'video urls dictionary')))
|
r'var\s+options\s*=\s*([^;]+);', page, 'options')
|
||||||
|
trans_options_str = js_to_json(options_str)
|
||||||
|
options = self._parse_json(trans_options_str, 'options', fatal=False) or {}
|
||||||
|
if options:
|
||||||
video_urls = options['video']
|
video_urls = options['video']
|
||||||
|
title = options['title']
|
||||||
|
poster = options.get('poster')
|
||||||
|
else:
|
||||||
|
video_urls = self._parse_json(self._html_search_regex(
|
||||||
|
r'"video"\s*:\s*({.+?})\s*,', trans_options_str, 'video urls'), 'video urls')
|
||||||
|
title = self._html_search_regex(
|
||||||
|
r"title\s*:\s*'(.+?)'\s*,", options_str, 'title')
|
||||||
|
poster = self._html_search_regex(
|
||||||
|
r"poster\s*:\s*'(.+?)'\s*,", options_str, 'poster', default=None)
|
||||||
|
|
||||||
if video_urls.get('youtube'):
|
if video_urls.get('youtube'):
|
||||||
return self.url_result(video_urls.get('youtube'), 'Youtube')
|
return self.url_result(video_urls.get('youtube'), 'Youtube')
|
||||||
@ -68,7 +79,7 @@ class UDNEmbedIE(InfoExtractor):
|
|||||||
formats.extend(self._extract_f4m_formats(
|
formats.extend(self._extract_f4m_formats(
|
||||||
video_url, video_id, f4m_id='hds'))
|
video_url, video_id, f4m_id='hds'))
|
||||||
else:
|
else:
|
||||||
mobj = re.search(r'_(?P<height>\d+)p_(?P<tbr>\d+).mp4', video_url)
|
mobj = re.search(r'_(?P<height>\d+)p_(?P<tbr>\d+)\.mp4', video_url)
|
||||||
a_format = {
|
a_format = {
|
||||||
'url': video_url,
|
'url': video_url,
|
||||||
# video_type may be 'mp4', which confuses YoutubeDL
|
# video_type may be 'mp4', which confuses YoutubeDL
|
||||||
@ -83,14 +94,9 @@ class UDNEmbedIE(InfoExtractor):
|
|||||||
|
|
||||||
self._sort_formats(formats)
|
self._sort_formats(formats)
|
||||||
|
|
||||||
thumbnails = [{
|
|
||||||
'url': img_url,
|
|
||||||
'id': img_type,
|
|
||||||
} for img_type, img_url in options.get('gallery', [{}])[0].items() if img_url]
|
|
||||||
|
|
||||||
return {
|
return {
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
'formats': formats,
|
'formats': formats,
|
||||||
'title': options['title'],
|
'title': title,
|
||||||
'thumbnails': thumbnails,
|
'thumbnail': poster,
|
||||||
}
|
}
|
||||||
|
32
youtube_dl/extractor/unity.py
Normal file
32
youtube_dl/extractor/unity.py
Normal file
@ -0,0 +1,32 @@
|
|||||||
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
|
from .common import InfoExtractor
|
||||||
|
from .youtube import YoutubeIE
|
||||||
|
|
||||||
|
|
||||||
|
class UnityIE(InfoExtractor):
|
||||||
|
_VALID_URL = r'https?://(?:www\.)?unity3d\.com/learn/tutorials/(?:[^/]+/)*(?P<id>[^/?#&]+)'
|
||||||
|
_TESTS = [{
|
||||||
|
'url': 'https://unity3d.com/learn/tutorials/topics/animation/animate-anything-mecanim',
|
||||||
|
'info_dict': {
|
||||||
|
'id': 'jWuNtik0C8E',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': 'Live Training 22nd September 2014 - Animate Anything',
|
||||||
|
'description': 'md5:e54913114bd45a554c56cdde7669636e',
|
||||||
|
'duration': 2893,
|
||||||
|
'uploader': 'Unity',
|
||||||
|
'uploader_id': 'Unity3D',
|
||||||
|
'upload_date': '20140926',
|
||||||
|
}
|
||||||
|
}, {
|
||||||
|
'url': 'https://unity3d.com/learn/tutorials/projects/2d-ufo-tutorial/following-player-camera?playlist=25844',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
video_id = self._match_id(url)
|
||||||
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
youtube_id = self._search_regex(
|
||||||
|
r'data-video-id="([_0-9a-zA-Z-]+)"',
|
||||||
|
webpage, 'youtube ID')
|
||||||
|
return self.url_result(youtube_id, ie=YoutubeIE.ie_key(), video_id=video_id)
|
@ -1,131 +1,41 @@
|
|||||||
|
# coding: utf-8
|
||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
from .mtv import MTVIE
|
from .mtv import MTVServicesInfoExtractor
|
||||||
|
|
||||||
import re
|
|
||||||
from ..utils import fix_xml_ampersands
|
|
||||||
|
|
||||||
|
|
||||||
class VH1IE(MTVIE):
|
class VH1IE(MTVServicesInfoExtractor):
|
||||||
IE_NAME = 'vh1.com'
|
IE_NAME = 'vh1.com'
|
||||||
_FEED_URL = 'http://www.vh1.com/player/embed/AS3/fullepisode/rss/'
|
_FEED_URL = 'http://www.vh1.com/feeds/mrss/'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.vh1.com/video/metal-evolution/full-episodes/progressive-metal/1678612/playlist.jhtml',
|
'url': 'http://www.vh1.com/episodes/0umwpq/hip-hop-squares-kent-jones-vs-nick-young-season-1-ep-120',
|
||||||
'playlist': [
|
|
||||||
{
|
|
||||||
'md5': '7827a7505f59633983165bbd2c119b52',
|
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '731565',
|
'title': 'Kent Jones vs. Nick Young',
|
||||||
'ext': 'mp4',
|
'description': 'Come to Play. Stay to Party. With Mike Epps, TIP, O’Shea Jackson Jr., T-Pain, Tisha Campbell-Martin and more.',
|
||||||
'title': 'Metal Evolution: Ep. 11 Act 1',
|
|
||||||
'description': 'Many rock academics have proclaimed that the truly progressive musicianship of the last 20 years has been found right here in the world of heavy metal, rather than obvious locales such as jazz, fusion or progressive rock. It stands to reason then, that much of this jaw-dropping virtuosity occurs within what\'s known as progressive metal, a genre that takes root with the likes of Rush in the \'70s, Queensryche and Fates Warning in the \'80s, and Dream Theater in the \'90s. Since then, the genre has exploded with creativity, spawning mind-bending, genre-defying acts such as Tool, Mastodon, Coheed And Cambria, Porcupine Tree, Meshuggah, A Perfect Circle and Opeth. Episode 12 looks at the extreme musicianship of these bands, as well as their often extreme literary prowess and conceptual strength, the end result being a rich level of respect and attention such challenging acts have brought upon the world of heavy metal, from a critical community usually dismissive of the form.'
|
|
||||||
}
|
|
||||||
},
|
},
|
||||||
{
|
'playlist_mincount': 4,
|
||||||
'md5': '34fb4b7321c546b54deda2102a61821f',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '731567',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Metal Evolution: Ep. 11 Act 2',
|
|
||||||
'description': 'Many rock academics have proclaimed that the truly progressive musicianship of the last 20 years has been found right here in the world of heavy metal, rather than obvious locales such as jazz, fusion or progressive rock. It stands to reason then, that much of this jaw-dropping virtuosity occurs within what\'s known as progressive metal, a genre that takes root with the likes of Rush in the \'70s, Queensryche and Fates Warning in the \'80s, and Dream Theater in the \'90s. Since then, the genre has exploded with creativity, spawning mind-bending, genre-defying acts such as Tool, Mastodon, Coheed And Cambria, Porcupine Tree, Meshuggah, A Perfect Circle and Opeth. Episode 11 looks at the extreme musicianship of these bands, as well as their often extreme literary prowess and conceptual strength, the end result being a rich level of respect and attention such challenging acts have brought upon the world of heavy metal, from a critical community usually dismissive of the form.'
|
|
||||||
}
|
|
||||||
},
|
|
||||||
{
|
|
||||||
'md5': '813f38dba4c1b8647196135ebbf7e048',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '731568',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Metal Evolution: Ep. 11 Act 3',
|
|
||||||
'description': 'Many rock academics have proclaimed that the truly progressive musicianship of the last 20 years has been found right here in the world of heavy metal, rather than obvious locales such as jazz, fusion or progressive rock. It stands to reason then, that much of this jaw-dropping virtuosity occurs within what\'s known as progressive metal, a genre that takes root with the likes of Rush in the \'70s, Queensryche and Fates Warning in the \'80s, and Dream Theater in the \'90s. Since then, the genre has exploded with creativity, spawning mind-bending, genre-defying acts such as Tool, Mastodon, Coheed And Cambria, Porcupine Tree, Meshuggah, A Perfect Circle and Opeth. Episode 11 looks at the extreme musicianship of these bands, as well as their often extreme literary prowess and conceptual strength, the end result being a rich level of respect and attention such challenging acts have brought upon the world of heavy metal, from a critical community usually dismissive of the form.'
|
|
||||||
}
|
|
||||||
},
|
|
||||||
{
|
|
||||||
'md5': '51adb72439dfaed11c799115d76e497f',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '731569',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Metal Evolution: Ep. 11 Act 4',
|
|
||||||
'description': 'Many rock academics have proclaimed that the truly progressive musicianship of the last 20 years has been found right here in the world of heavy metal, rather than obvious locales such as jazz, fusion or progressive rock. It stands to reason then, that much of this jaw-dropping virtuosity occurs within what\'s known as progressive metal, a genre that takes root with the likes of Rush in the \'70s, Queensryche and Fates Warning in the \'80s, and Dream Theater in the \'90s. Since then, the genre has exploded with creativity, spawning mind-bending, genre-defying acts such as Tool, Mastodon, Coheed And Cambria, Porcupine Tree, Meshuggah, A Perfect Circle and Opeth. Episode 11 looks at the extreme musicianship of these bands, as well as their often extreme literary prowess and conceptual strength, the end result being a rich level of respect and attention such challenging acts have brought upon the world of heavy metal, from a critical community usually dismissive of the form.'
|
|
||||||
}
|
|
||||||
},
|
|
||||||
{
|
|
||||||
'md5': '93d554aaf79320703b73a95288c76a6e',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '731570',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Metal Evolution: Ep. 11 Act 5',
|
|
||||||
'description': 'Many rock academics have proclaimed that the truly progressive musicianship of the last 20 years has been found right here in the world of heavy metal, rather than obvious locales such as jazz, fusion or progressive rock. It stands to reason then, that much of this jaw-dropping virtuosity occurs within what\'s known as progressive metal, a genre that takes root with the likes of Rush in the \'70s, Queensryche and Fates Warning in the \'80s, and Dream Theater in the \'90s. Since then, the genre has exploded with creativity, spawning mind-bending, genre-defying acts such as Tool, Mastodon, Coheed And Cambria, Porcupine Tree, Meshuggah, A Perfect Circle and Opeth. Episode 11 looks at the extreme musicianship of these bands, as well as their often extreme literary prowess and conceptual strength, the end result being a rich level of respect and attention such challenging acts have brought upon the world of heavy metal, from a critical community usually dismissive of the form.'
|
|
||||||
}
|
|
||||||
}
|
|
||||||
],
|
|
||||||
'skip': 'Blocked outside the US',
|
|
||||||
}, {
|
}, {
|
||||||
# Clip
|
# Clip
|
||||||
'url': 'http://www.vh1.com/video/misc/706675/metal-evolution-episode-1-pre-metal-show-clip.jhtml#id=1674118',
|
'url': 'http://www.vh1.com/video-clips/t74mif/scared-famous-scared-famous-extended-preview',
|
||||||
'md5': '7d67cf6d9cdc6b4f3d3ac97a55403844',
|
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
'id': '706675',
|
'id': '0a50c2d2-a86b-4141-9565-911c7e2d0b92',
|
||||||
'ext': 'mp4',
|
'ext': 'mp4',
|
||||||
'title': 'Metal Evolution: Episode 1 Pre-Metal Show Clip',
|
'title': 'Scared Famous|October 9, 2017|1|NO-EPISODE#|Scared Famous + Extended Preview',
|
||||||
'description': 'The greatest documentary ever made about Heavy Metal begins as our host Sam Dunn travels the globe to seek out the origins and influences that helped create Heavy Metal. Sam speaks to legends like Kirk Hammett, Alice Cooper, Slash, Bill Ward, Geezer Butler, Tom Morello, Ace Frehley, Lemmy Kilmister, Dave Davies, and many many more. This episode is the prologue for the 11 hour series, and Sam goes back to the very beginning to reveal how Heavy Metal was created.'
|
'description': 'md5:eff5551a274c473a29463de40f7b09da',
|
||||||
|
'upload_date': '20171009',
|
||||||
|
'timestamp': 1507574700,
|
||||||
},
|
},
|
||||||
'skip': 'Blocked outside the US',
|
'params': {
|
||||||
}, {
|
# m3u8 download
|
||||||
# Short link
|
'skip_download': True,
|
||||||
'url': 'http://www.vh1.com/video/play.jhtml?id=1678353',
|
|
||||||
'md5': '853192b87ad978732b67dd8e549b266a',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '730355',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Metal Evolution: Episode 11 Progressive Metal Sneak',
|
|
||||||
'description': 'In Metal Evolution\'s finale sneak, Sam sits with Michael Giles of King Crimson and gets feedback from Metallica guitarist Kirk Hammett on why the group was influential.'
|
|
||||||
},
|
},
|
||||||
'skip': 'Blocked outside the US',
|
|
||||||
}, {
|
|
||||||
'url': 'http://www.vh1.com/video/macklemore-ryan-lewis/900535/cant-hold-us-ft-ray-dalton.jhtml',
|
|
||||||
'md5': 'b1bcb5b4380c9d7f544065589432dee7',
|
|
||||||
'info_dict': {
|
|
||||||
'id': '900535',
|
|
||||||
'ext': 'mp4',
|
|
||||||
'title': 'Macklemore & Ryan Lewis - "Can\'t Hold Us ft. Ray Dalton"',
|
|
||||||
'description': 'The Heist'
|
|
||||||
},
|
|
||||||
'skip': 'Blocked outside the US',
|
|
||||||
}]
|
}]
|
||||||
|
|
||||||
_VALID_URL = r'''(?x)
|
_VALID_URL = r'https?://(?:www\.)?vh1\.com/(?:video-clips|episodes)/(?P<id>[^/?#.]+)'
|
||||||
https?://www\.vh1\.com/video/
|
|
||||||
(?:
|
|
||||||
.+?/full-episodes/.+?/(?P<playlist_id>[^/]+)/playlist\.jhtml
|
|
||||||
|
|
|
||||||
(?:
|
|
||||||
play.jhtml\?id=|
|
|
||||||
misc/.+?/.+?\.jhtml\#id=
|
|
||||||
)
|
|
||||||
(?P<video_id>[0-9]+)$
|
|
||||||
|
|
|
||||||
[^/]+/(?P<music_id>[0-9]+)/[^/]+?
|
|
||||||
)
|
|
||||||
'''
|
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
mobj = re.match(self._VALID_URL, url)
|
playlist_id = self._match_id(url)
|
||||||
if mobj.group('music_id'):
|
webpage = self._download_webpage(url, playlist_id)
|
||||||
id_field = 'vid'
|
mgid = self._extract_triforce_mgid(webpage)
|
||||||
video_id = mobj.group('music_id')
|
videos_info = self._get_videos_info(mgid)
|
||||||
else:
|
return videos_info
|
||||||
video_id = mobj.group('playlist_id') or mobj.group('video_id')
|
|
||||||
id_field = 'id'
|
|
||||||
doc_url = '%s?%s=%s' % (self._FEED_URL, id_field, video_id)
|
|
||||||
|
|
||||||
idoc = self._download_xml(
|
|
||||||
doc_url, video_id,
|
|
||||||
'Downloading info', transform_source=fix_xml_ampersands)
|
|
||||||
|
|
||||||
entries = []
|
|
||||||
for item in idoc.findall('.//item'):
|
|
||||||
info = self._get_video_info(item)
|
|
||||||
if info:
|
|
||||||
entries.append(info)
|
|
||||||
|
|
||||||
return self.playlist_result(entries, playlist_id=video_id)
|
|
||||||
|
@ -198,7 +198,7 @@ class ViceShowIE(InfoExtractor):
|
|||||||
|
|
||||||
class ViceArticleIE(InfoExtractor):
|
class ViceArticleIE(InfoExtractor):
|
||||||
IE_NAME = 'vice:article'
|
IE_NAME = 'vice:article'
|
||||||
_VALID_URL = r'https://www.vice.com/[^/]+/article/(?P<id>[^?#]+)'
|
_VALID_URL = r'https://www\.vice\.com/[^/]+/article/(?P<id>[^?#]+)'
|
||||||
|
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'https://www.vice.com/en_us/article/on-set-with-the-woman-making-mormon-porn-in-utah',
|
'url': 'https://www.vice.com/en_us/article/on-set-with-the-woman-making-mormon-porn-in-utah',
|
||||||
|
@ -26,7 +26,7 @@ class VideoPremiumIE(InfoExtractor):
|
|||||||
webpage_url = 'http://videopremium.tv/' + video_id
|
webpage_url = 'http://videopremium.tv/' + video_id
|
||||||
webpage = self._download_webpage(webpage_url, video_id)
|
webpage = self._download_webpage(webpage_url, video_id)
|
||||||
|
|
||||||
if re.match(r'^<html><head><script[^>]*>window.location\s*=', webpage):
|
if re.match(r'^<html><head><script[^>]*>window\.location\s*=', webpage):
|
||||||
# Download again, we need a cookie
|
# Download again, we need a cookie
|
||||||
webpage = self._download_webpage(
|
webpage = self._download_webpage(
|
||||||
webpage_url, video_id,
|
webpage_url, video_id,
|
||||||
|
@ -2,11 +2,44 @@
|
|||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
from .common import InfoExtractor
|
from .common import InfoExtractor
|
||||||
|
from .once import OnceIE
|
||||||
from ..compat import compat_urllib_parse_unquote
|
from ..compat import compat_urllib_parse_unquote
|
||||||
|
from ..utils import ExtractorError
|
||||||
|
|
||||||
|
|
||||||
|
class VoxMediaVolumeIE(OnceIE):
|
||||||
|
_VALID_URL = r'https?://volume\.vox-cdn\.com/embed/(?P<id>[0-9a-f]{9})'
|
||||||
|
|
||||||
|
def _real_extract(self, url):
|
||||||
|
video_id = self._match_id(url)
|
||||||
|
webpage = self._download_webpage(url, video_id)
|
||||||
|
video_data = self._parse_json(self._search_regex(
|
||||||
|
r'Volume\.createVideo\(({.+})\s*,\s*{.*}\s*,\s*\[.*\]\s*,\s*{.*}\);', webpage, 'video data'), video_id)
|
||||||
|
for provider_video_type in ('ooyala', 'youtube', 'brightcove'):
|
||||||
|
provider_video_id = video_data.get('%s_id' % provider_video_type)
|
||||||
|
if not provider_video_id:
|
||||||
|
continue
|
||||||
|
info = {
|
||||||
|
'id': video_id,
|
||||||
|
'title': video_data.get('title_short'),
|
||||||
|
'description': video_data.get('description_long') or video_data.get('description_short'),
|
||||||
|
'thumbnail': video_data.get('brightcove_thumbnail')
|
||||||
|
}
|
||||||
|
if provider_video_type == 'brightcove':
|
||||||
|
info['formats'] = self._extract_once_formats(provider_video_id)
|
||||||
|
self._sort_formats(info['formats'])
|
||||||
|
else:
|
||||||
|
info.update({
|
||||||
|
'_type': 'url_transparent',
|
||||||
|
'url': provider_video_id if provider_video_type == 'youtube' else '%s:%s' % (provider_video_type, provider_video_id),
|
||||||
|
'ie_key': provider_video_type.capitalize(),
|
||||||
|
})
|
||||||
|
return info
|
||||||
|
raise ExtractorError('Unable to find provider video id')
|
||||||
|
|
||||||
|
|
||||||
class VoxMediaIE(InfoExtractor):
|
class VoxMediaIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?(?:theverge|vox|sbnation|eater|polygon|curbed|racked)\.com/(?:[^/]+/)*(?P<id>[^/?]+)'
|
_VALID_URL = r'https?://(?:www\.)?(?:(?:theverge|vox|sbnation|eater|polygon|curbed|racked)\.com|recode\.net)/(?:[^/]+/)*(?P<id>[^/?]+)'
|
||||||
_TESTS = [{
|
_TESTS = [{
|
||||||
'url': 'http://www.theverge.com/2014/6/27/5849272/material-world-how-google-discovered-what-software-is-made-of',
|
'url': 'http://www.theverge.com/2014/6/27/5849272/material-world-how-google-discovered-what-software-is-made-of',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -31,6 +64,7 @@ class VoxMediaIE(InfoExtractor):
|
|||||||
'description': 'md5:87a51fe95ff8cea8b5bdb9ac7ae6a6af',
|
'description': 'md5:87a51fe95ff8cea8b5bdb9ac7ae6a6af',
|
||||||
},
|
},
|
||||||
'add_ie': ['Ooyala'],
|
'add_ie': ['Ooyala'],
|
||||||
|
'skip': 'Video Not Found',
|
||||||
}, {
|
}, {
|
||||||
# volume embed
|
# volume embed
|
||||||
'url': 'http://www.vox.com/2016/3/31/11336640/mississippi-lgbt-religious-freedom-bill',
|
'url': 'http://www.vox.com/2016/3/31/11336640/mississippi-lgbt-religious-freedom-bill',
|
||||||
@ -84,6 +118,17 @@ class VoxMediaIE(InfoExtractor):
|
|||||||
'description': 'md5:e02d56b026d51aa32c010676765a690d',
|
'description': 'md5:e02d56b026d51aa32c010676765a690d',
|
||||||
},
|
},
|
||||||
}],
|
}],
|
||||||
|
}, {
|
||||||
|
# volume embed, Brightcove Once
|
||||||
|
'url': 'https://www.recode.net/2014/6/17/11628066/post-post-pc-ceo-the-full-code-conference-video-of-microsofts-satya',
|
||||||
|
'md5': '01571a896281f77dc06e084138987ea2',
|
||||||
|
'info_dict': {
|
||||||
|
'id': '1231c973d',
|
||||||
|
'ext': 'mp4',
|
||||||
|
'title': 'Post-Post-PC CEO: The Full Code Conference Video of Microsoft\'s Satya Nadella',
|
||||||
|
'description': 'The longtime veteran was chosen earlier this year as the software giant\'s third leader in its history.',
|
||||||
|
},
|
||||||
|
'add_ie': ['VoxMediaVolume'],
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
@ -91,9 +136,14 @@ class VoxMediaIE(InfoExtractor):
|
|||||||
webpage = compat_urllib_parse_unquote(self._download_webpage(url, display_id))
|
webpage = compat_urllib_parse_unquote(self._download_webpage(url, display_id))
|
||||||
|
|
||||||
def create_entry(provider_video_id, provider_video_type, title=None, description=None):
|
def create_entry(provider_video_id, provider_video_type, title=None, description=None):
|
||||||
|
video_url = {
|
||||||
|
'youtube': '%s',
|
||||||
|
'ooyala': 'ooyala:%s',
|
||||||
|
'volume': 'http://volume.vox-cdn.com/embed/%s',
|
||||||
|
}[provider_video_type] % provider_video_id
|
||||||
return {
|
return {
|
||||||
'_type': 'url_transparent',
|
'_type': 'url_transparent',
|
||||||
'url': provider_video_id if provider_video_type == 'youtube' else '%s:%s' % (provider_video_type, provider_video_id),
|
'url': video_url,
|
||||||
'title': title or self._og_search_title(webpage),
|
'title': title or self._og_search_title(webpage),
|
||||||
'description': description or self._og_search_description(webpage),
|
'description': description or self._og_search_description(webpage),
|
||||||
}
|
}
|
||||||
@ -124,17 +174,7 @@ class VoxMediaIE(InfoExtractor):
|
|||||||
volume_uuid = self._search_regex(
|
volume_uuid = self._search_regex(
|
||||||
r'data-volume-uuid="([^"]+)"', webpage, 'volume uuid', default=None)
|
r'data-volume-uuid="([^"]+)"', webpage, 'volume uuid', default=None)
|
||||||
if volume_uuid:
|
if volume_uuid:
|
||||||
volume_webpage = self._download_webpage(
|
entries.append(create_entry(volume_uuid, 'volume'))
|
||||||
'http://volume.vox-cdn.com/embed/%s' % volume_uuid, volume_uuid)
|
|
||||||
video_data = self._parse_json(self._search_regex(
|
|
||||||
r'Volume\.createVideo\(({.+})\s*,\s*{.*}\s*,\s*\[.*\]\s*,\s*{.*}\);', volume_webpage, 'video data'), volume_uuid)
|
|
||||||
for provider_video_type in ('ooyala', 'youtube'):
|
|
||||||
provider_video_id = video_data.get('%s_id' % provider_video_type)
|
|
||||||
if provider_video_id:
|
|
||||||
description = video_data.get('description_long') or video_data.get('description_short')
|
|
||||||
entries.append(create_entry(
|
|
||||||
provider_video_id, provider_video_type, video_data.get('title_short'), description))
|
|
||||||
break
|
|
||||||
|
|
||||||
if len(entries) == 1:
|
if len(entries) == 1:
|
||||||
return entries[0]
|
return entries[0]
|
||||||
|
@ -133,7 +133,7 @@ class VVVVIDIE(InfoExtractor):
|
|||||||
'season_id': season_id,
|
'season_id': season_id,
|
||||||
'season_number': video_data.get('season_number'),
|
'season_number': video_data.get('season_number'),
|
||||||
'episode_id': str_or_none(video_data.get('id')),
|
'episode_id': str_or_none(video_data.get('id')),
|
||||||
'epidode_number': int_or_none(video_data.get('number')),
|
'episode_number': int_or_none(video_data.get('number')),
|
||||||
'episode_title': video_data['title'],
|
'episode_title': video_data['title'],
|
||||||
'view_count': int_or_none(video_data.get('views')),
|
'view_count': int_or_none(video_data.get('views')),
|
||||||
'like_count': int_or_none(video_data.get('video_likes')),
|
'like_count': int_or_none(video_data.get('video_likes')),
|
||||||
|
@ -22,8 +22,13 @@ class WDRBaseIE(InfoExtractor):
|
|||||||
# for wdrmaus, in a tag with the class "videoButton" (previously a link
|
# for wdrmaus, in a tag with the class "videoButton" (previously a link
|
||||||
# to the page in a multiline "videoLink"-tag)
|
# to the page in a multiline "videoLink"-tag)
|
||||||
json_metadata = self._html_search_regex(
|
json_metadata = self._html_search_regex(
|
||||||
r'class=(?:"(?:mediaLink|wdrrPlayerPlayBtn|videoButton)\b[^"]*"[^>]+|"videoLink\b[^"]*"[\s]*>\n[^\n]*)data-extension="([^"]+)"',
|
r'''(?sx)class=
|
||||||
webpage, 'media link', default=None, flags=re.MULTILINE)
|
(?:
|
||||||
|
(["\'])(?:mediaLink|wdrrPlayerPlayBtn|videoButton)\b.*?\1[^>]+|
|
||||||
|
(["\'])videoLink\b.*?\2[\s]*>\n[^\n]*
|
||||||
|
)data-extension=(["\'])(?P<data>(?:(?!\3).)+)\3
|
||||||
|
''',
|
||||||
|
webpage, 'media link', default=None, group='data')
|
||||||
|
|
||||||
if not json_metadata:
|
if not json_metadata:
|
||||||
return
|
return
|
||||||
|
@ -18,7 +18,7 @@ class XTubeIE(InfoExtractor):
|
|||||||
_VALID_URL = r'''(?x)
|
_VALID_URL = r'''(?x)
|
||||||
(?:
|
(?:
|
||||||
xtube:|
|
xtube:|
|
||||||
https?://(?:www\.)?xtube\.com/(?:watch\.php\?.*\bv=|video-watch/(?P<display_id>[^/]+)-)
|
https?://(?:www\.)?xtube\.com/(?:watch\.php\?.*\bv=|video-watch/(?:embedded/)?(?P<display_id>[^/]+)-)
|
||||||
)
|
)
|
||||||
(?P<id>[^/?&#]+)
|
(?P<id>[^/?&#]+)
|
||||||
'''
|
'''
|
||||||
@ -64,6 +64,9 @@ class XTubeIE(InfoExtractor):
|
|||||||
}, {
|
}, {
|
||||||
'url': 'xtube:kVTUy_G222_',
|
'url': 'xtube:kVTUy_G222_',
|
||||||
'only_matching': True,
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'https://www.xtube.com/video-watch/embedded/milf-tara-and-teen-shared-and-cum-covered-extreme-bukkake-32203482?embedsize=big',
|
||||||
|
'only_matching': True,
|
||||||
}]
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
|
@ -14,8 +14,16 @@ from ..utils import (
|
|||||||
|
|
||||||
|
|
||||||
class XVideosIE(InfoExtractor):
|
class XVideosIE(InfoExtractor):
|
||||||
_VALID_URL = r'https?://(?:www\.)?xvideos\.com/video(?P<id>[0-9]+)(?:.*)'
|
_VALID_URL = r'''(?x)
|
||||||
_TEST = {
|
https?://
|
||||||
|
(?:
|
||||||
|
(?:www\.)?xvideos\.com/video|
|
||||||
|
flashservice\.xvideos\.com/embedframe/|
|
||||||
|
static-hw\.xvideos\.com/swf/xv-player\.swf\?.*?\bid_video=
|
||||||
|
)
|
||||||
|
(?P<id>[0-9]+)
|
||||||
|
'''
|
||||||
|
_TESTS = [{
|
||||||
'url': 'http://www.xvideos.com/video4588838/biker_takes_his_girl',
|
'url': 'http://www.xvideos.com/video4588838/biker_takes_his_girl',
|
||||||
'md5': '14cea69fcb84db54293b1e971466c2e1',
|
'md5': '14cea69fcb84db54293b1e971466c2e1',
|
||||||
'info_dict': {
|
'info_dict': {
|
||||||
@ -25,21 +33,33 @@ class XVideosIE(InfoExtractor):
|
|||||||
'duration': 108,
|
'duration': 108,
|
||||||
'age_limit': 18,
|
'age_limit': 18,
|
||||||
}
|
}
|
||||||
}
|
}, {
|
||||||
|
'url': 'https://flashservice.xvideos.com/embedframe/4588838',
|
||||||
|
'only_matching': True,
|
||||||
|
}, {
|
||||||
|
'url': 'http://static-hw.xvideos.com/swf/xv-player.swf?id_video=4588838',
|
||||||
|
'only_matching': True,
|
||||||
|
}]
|
||||||
|
|
||||||
def _real_extract(self, url):
|
def _real_extract(self, url):
|
||||||
video_id = self._match_id(url)
|
video_id = self._match_id(url)
|
||||||
webpage = self._download_webpage(url, video_id)
|
|
||||||
|
webpage = self._download_webpage(
|
||||||
|
'http://www.xvideos.com/video%s/' % video_id, video_id)
|
||||||
|
|
||||||
mobj = re.search(r'<h1 class="inlineError">(.+?)</h1>', webpage)
|
mobj = re.search(r'<h1 class="inlineError">(.+?)</h1>', webpage)
|
||||||
if mobj:
|
if mobj:
|
||||||
raise ExtractorError('%s said: %s' % (self.IE_NAME, clean_html(mobj.group(1))), expected=True)
|
raise ExtractorError('%s said: %s' % (self.IE_NAME, clean_html(mobj.group(1))), expected=True)
|
||||||
|
|
||||||
video_title = self._html_search_regex(
|
title = self._html_search_regex(
|
||||||
r'<title>(.*?)\s+-\s+XVID', webpage, 'title')
|
(r'<title>(?P<title>.+?)\s+-\s+XVID',
|
||||||
video_thumbnail = self._search_regex(
|
r'setVideoTitle\s*\(\s*(["\'])(?P<title>(?:(?!\1).)+)\1'),
|
||||||
|
webpage, 'title', default=None,
|
||||||
|
group='title') or self._og_search_title(webpage)
|
||||||
|
|
||||||
|
thumbnail = self._search_regex(
|
||||||
r'url_bigthumb=(.+?)&', webpage, 'thumbnail', fatal=False)
|
r'url_bigthumb=(.+?)&', webpage, 'thumbnail', fatal=False)
|
||||||
video_duration = int_or_none(self._og_search_property(
|
duration = int_or_none(self._og_search_property(
|
||||||
'duration', webpage, default=None)) or parse_duration(
|
'duration', webpage, default=None)) or parse_duration(
|
||||||
self._search_regex(
|
self._search_regex(
|
||||||
r'<span[^>]+class=["\']duration["\'][^>]*>.*?(\d[^<]+)',
|
r'<span[^>]+class=["\']duration["\'][^>]*>.*?(\d[^<]+)',
|
||||||
@ -74,8 +94,8 @@ class XVideosIE(InfoExtractor):
|
|||||||
return {
|
return {
|
||||||
'id': video_id,
|
'id': video_id,
|
||||||
'formats': formats,
|
'formats': formats,
|
||||||
'title': video_title,
|
'title': title,
|
||||||
'duration': video_duration,
|
'duration': duration,
|
||||||
'thumbnail': video_thumbnail,
|
'thumbnail': thumbnail,
|
||||||
'age_limit': 18,
|
'age_limit': 18,
|
||||||
}
|
}
|
||||||
|
@ -332,6 +332,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
|
|||||||
(?:(?:(?:(?:\w+\.)?[yY][oO][uU][tT][uU][bB][eE](?:-nocookie)?\.com/|
|
(?:(?:(?:(?:\w+\.)?[yY][oO][uU][tT][uU][bB][eE](?:-nocookie)?\.com/|
|
||||||
(?:www\.)?deturl\.com/www\.youtube\.com/|
|
(?:www\.)?deturl\.com/www\.youtube\.com/|
|
||||||
(?:www\.)?pwnyoutube\.com/|
|
(?:www\.)?pwnyoutube\.com/|
|
||||||
|
(?:www\.)?hooktube\.com/|
|
||||||
(?:www\.)?yourepeat\.com/|
|
(?:www\.)?yourepeat\.com/|
|
||||||
tube\.majestyc\.net/|
|
tube\.majestyc\.net/|
|
||||||
youtube\.googleapis\.com/) # the various hostnames, with wildcard subdomains
|
youtube\.googleapis\.com/) # the various hostnames, with wildcard subdomains
|
||||||
@ -1621,6 +1622,17 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
|
|||||||
# description
|
# description
|
||||||
description_original = video_description = get_element_by_id("eow-description", video_webpage)
|
description_original = video_description = get_element_by_id("eow-description", video_webpage)
|
||||||
if video_description:
|
if video_description:
|
||||||
|
|
||||||
|
def replace_url(m):
|
||||||
|
redir_url = compat_urlparse.urljoin(url, m.group(1))
|
||||||
|
parsed_redir_url = compat_urllib_parse_urlparse(redir_url)
|
||||||
|
if re.search(r'^(?:www\.)?(?:youtube(?:-nocookie)?\.com|youtu\.be)$', parsed_redir_url.netloc) and parsed_redir_url.path == '/redirect':
|
||||||
|
qs = compat_parse_qs(parsed_redir_url.query)
|
||||||
|
q = qs.get('q')
|
||||||
|
if q and q[0]:
|
||||||
|
return q[0]
|
||||||
|
return redir_url
|
||||||
|
|
||||||
description_original = video_description = re.sub(r'''(?x)
|
description_original = video_description = re.sub(r'''(?x)
|
||||||
<a\s+
|
<a\s+
|
||||||
(?:[a-zA-Z-]+="[^"]*"\s+)*?
|
(?:[a-zA-Z-]+="[^"]*"\s+)*?
|
||||||
@ -1629,7 +1641,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
|
|||||||
class="[^"]*"[^>]*>
|
class="[^"]*"[^>]*>
|
||||||
[^<]+\.{3}\s*
|
[^<]+\.{3}\s*
|
||||||
</a>
|
</a>
|
||||||
''', r'\1', video_description)
|
''', replace_url, video_description)
|
||||||
video_description = clean_html(video_description)
|
video_description = clean_html(video_description)
|
||||||
else:
|
else:
|
||||||
fd_mobj = re.search(r'<meta name="description" content="([^"]+)"', video_webpage)
|
fd_mobj = re.search(r'<meta name="description" content="([^"]+)"', video_webpage)
|
||||||
@ -1682,7 +1694,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
|
|||||||
video_uploader_id = None
|
video_uploader_id = None
|
||||||
video_uploader_url = None
|
video_uploader_url = None
|
||||||
mobj = re.search(
|
mobj = re.search(
|
||||||
r'<link itemprop="url" href="(?P<uploader_url>https?://www.youtube.com/(?:user|channel)/(?P<uploader_id>[^"]+))">',
|
r'<link itemprop="url" href="(?P<uploader_url>https?://www\.youtube\.com/(?:user|channel)/(?P<uploader_id>[^"]+))">',
|
||||||
video_webpage)
|
video_webpage)
|
||||||
if mobj is not None:
|
if mobj is not None:
|
||||||
video_uploader_id = mobj.group('uploader_id')
|
video_uploader_id = mobj.group('uploader_id')
|
||||||
@ -2039,39 +2051,6 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
|
|||||||
}
|
}
|
||||||
|
|
||||||
|
|
||||||
class YoutubeSharedVideoIE(InfoExtractor):
|
|
||||||
_VALID_URL = r'(?:https?:)?//(?:www\.)?youtube\.com/shared\?.*\bci=(?P<id>[0-9A-Za-z_-]{11})'
|
|
||||||
IE_NAME = 'youtube:shared'
|
|
||||||
|
|
||||||
_TEST = {
|
|
||||||
'url': 'https://www.youtube.com/shared?ci=1nEzmT-M4fU',
|
|
||||||
'info_dict': {
|
|
||||||
'id': 'uPDB5I9wfp8',
|
|
||||||
'ext': 'webm',
|
|
||||||
'title': 'Pocoyo: 90 minutos de episódios completos Português para crianças - PARTE 3',
|
|
||||||
'description': 'md5:d9e4d9346a2dfff4c7dc4c8cec0f546d',
|
|
||||||
'upload_date': '20160219',
|
|
||||||
'uploader': 'Pocoyo - Português (BR)',
|
|
||||||
'uploader_id': 'PocoyoBrazil',
|
|
||||||
},
|
|
||||||
'add_ie': ['Youtube'],
|
|
||||||
'params': {
|
|
||||||
# There are already too many Youtube downloads
|
|
||||||
'skip_download': True,
|
|
||||||
},
|
|
||||||
}
|
|
||||||
|
|
||||||
def _real_extract(self, url):
|
|
||||||
video_id = self._match_id(url)
|
|
||||||
|
|
||||||
webpage = self._download_webpage(url, video_id)
|
|
||||||
|
|
||||||
real_video_id = self._html_search_meta(
|
|
||||||
'videoId', webpage, 'YouTube video id', fatal=True)
|
|
||||||
|
|
||||||
return self.url_result(real_video_id, YoutubeIE.ie_key())
|
|
||||||
|
|
||||||
|
|
||||||
class YoutubePlaylistIE(YoutubePlaylistBaseInfoExtractor):
|
class YoutubePlaylistIE(YoutubePlaylistBaseInfoExtractor):
|
||||||
IE_DESC = 'YouTube.com playlists'
|
IE_DESC = 'YouTube.com playlists'
|
||||||
_VALID_URL = r"""(?x)(?:
|
_VALID_URL = r"""(?x)(?:
|
||||||
|
@ -1933,7 +1933,7 @@ class PagedList(object):
|
|||||||
|
|
||||||
|
|
||||||
class OnDemandPagedList(PagedList):
|
class OnDemandPagedList(PagedList):
|
||||||
def __init__(self, pagefunc, pagesize, use_cache=False):
|
def __init__(self, pagefunc, pagesize, use_cache=True):
|
||||||
self._pagefunc = pagefunc
|
self._pagefunc = pagefunc
|
||||||
self._pagesize = pagesize
|
self._pagesize = pagesize
|
||||||
self._use_cache = use_cache
|
self._use_cache = use_cache
|
||||||
|
@ -1,3 +1,3 @@
|
|||||||
from __future__ import unicode_literals
|
from __future__ import unicode_literals
|
||||||
|
|
||||||
__version__ = '2017.10.01'
|
__version__ = '2017.10.20'
|
||||||
|
Loading…
x
Reference in New Issue
Block a user