365 Commits

Author SHA1 Message Date
Unknown
6c894ea191 [youtube/ytsearch] fix yt search feed + version update. 2020-10-24 06:57:14 +02:00
Unknown
dd2d55f10d COMPLAINFREE 2020-10-24 05:09:22 +02:00
Unknown
3a6a581d94 Merge remote-tracking branch 'origin/master' 2020-10-23 16:53:59 +02:00
Unknown
07bafb4a90 [reddit] best format hotfix based on resolution. 2020-10-23 16:53:52 +02:00
Tom-Oliver Heidel
7eff09d332
Merge pull request #196 from blackjack4494/twitter_shortener
Twitter shortener
2020-10-18 07:43:42 +02:00
Unknown
957c523eea [youtube] cookie update reminder 2020-10-18 03:04:10 +02:00
Unknown
a537ab1a09 [twitter/t.co] update supportedsites, failover replace, tco:id feature 2020-10-18 02:14:13 +02:00
Unknown
9e20a9c447 [twitter/t.co] implemented. 2020-10-17 10:24:57 +02:00
Unknown
51707d9a7a [MTV/Nick] universal mgid extractor + fix nick.de feed 2020-10-17 08:26:39 +02:00
Unknown
f33b7b5eb4 [Twitter/t.co] showcase expanded how to use generic 2020-10-13 02:03:48 +02:00
Unknown
86b868c6a5 [Twitter/t.co] simple extractor added. modification needed. 2020-10-13 01:58:59 +02:00
Tom-Oliver Heidel
d8f97cc1d3
Merge pull request #188 from blackjack4494/SouthparkDE_MTV
[SouthparkDE/MTV] another mgid extraction (mtv_base) feed url updated
2020-10-13 01:03:29 +02:00
Tom-Oliver Heidel
573c752256 Merge branch 'la7-fix' of https://github.com/iamleot/youtube-dl into iamleot-la7-fix 2020-10-13 00:58:04 +02:00
Unknown
bc887cdd01 [SouthparkDE] regex and tests 2020-10-13 00:47:17 +02:00
Unknown
320724f964 [SouthparkDE/MTV] another mgid extraction (mtv_base) feed url updated 2020-10-12 23:46:02 +02:00
Tom-Oliver Heidel
60ecb525b2 Merge branch 'fixYTSearch' of https://github.com/xarantolus/youtube-dl into xarantolus-fixYTSearch 2020-10-09 08:19:38 +02:00
Tom-Oliver Heidel
cfd7f14bb3
Merge pull request #176 from blackjack4494/mtv_updated_extractor_logic
[Mtv] updated extractor logic & more
2020-10-09 08:01:31 +02:00
Tom-Oliver Heidel
b492464bf1
Merge pull request #171 from blackjack4494/yt_only_age_gate
[youtube] fix yt-only playback when age restricted/gated - requires cookies
2020-10-09 07:57:39 +02:00
Unknown
cf7cb94287 [mtvn] update mtv network related extractors 2020-10-09 07:50:22 +02:00
Unknown
b6e0c7d2e3 [mtv] fix mtv.com and more(?) 2020-10-09 07:06:49 +02:00
Unknown
962cc3ef87 merge bandcamp 2020-10-07 05:42:38 +02:00
Unknown
b777004649 Merge branch 'ytdl-org-master' 2020-10-07 05:34:22 +02:00
Tom-Oliver Heidel
044ecf795d Merge branch 'feature_subscriber_count' of https://github.com/RedpointsBots/youtube-dl into RedpointsBots-feature_subscriber_count 2020-10-07 05:22:31 +02:00
Tom-Oliver Heidel
a87a873d24 Merge branch 'bugfix_youtube_like_extraction' of https://github.com/RedpointsBots/youtube-dl into RedpointsBots-bugfix_youtube_like_extraction 2020-10-07 05:13:25 +02:00
Unknown
c73baf23e0 fix to support python 2.6 2020-10-07 04:54:38 +02:00
Unknown
4bb9c8802e flake8 2020-10-07 04:31:23 +02:00
Unknown
9d9314cb66 [youtube] only playable on yt and age gated 2020-10-07 04:19:08 +02:00
Unknown
3d6a47d35f [skip travis] version 2020-09-30 07:11:49 +02:00
Unknown
bdc3fd2f35 [core] add option to trim file name length with integer
https://github.com/blackjack4494/youtube-dlc/issues/85
2020-09-30 05:50:09 +02:00
Unknown
6923b5381f [hotstar] several api changes and payloads/queries 2020-09-30 03:51:40 +02:00
Unknown
3a379e5e83 [Bandcamp] update - fix regexp for JSON matching 2020-09-29 05:54:36 +02:00
Unknown
0c9df79e17 [core] no sleep affected subtitles only with enforced flag 2020-09-29 05:11:32 +02:00
Unknown
88bdacf33c Merge remote-tracking branch 'origin/master' 2020-09-29 01:42:36 +02:00
Unknown
8219ef6427 [tiktok] add referer - required to download from cdn 2020-09-29 01:42:25 +02:00
stephen
61e4c6ed45 Added regex for ABC.com site. 2020-09-27 05:33:37 -05:00
Unknown
b33c48f269 [skip travis] version bump 2020-09-23 05:11:32 +02:00
Tom-Oliver Heidel
04b61c6572 Merge branch 'naver' of https://github.com/SeonjaeHyeon/youtube-dl into SeonjaeHyeon-naver 2020-09-23 04:01:51 +02:00
Unknown
915f2a92ac update workflow, semi fix integrated updater 2020-09-23 03:16:06 +02:00
Unknown
1b3f7c9a7e merge youtube-dl master 22.09.2020 2020-09-22 16:09:54 +02:00
Jody Bruchon
a45e861918 Switch from binary search tree to Python sets
Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-18 21:18:23 -04:00
Jody Bruchon
fd87f42378 Randomize the ArchiveTree the proper Python way
Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-18 14:22:42 -04:00
Jody Bruchon
2459b6e1cf Style revisions 2020-09-18 09:35:21 -04:00
Jody Bruchon
4f0150dcec Merge remote-tracking branch 'upstream/master' 2020-09-18 08:49:11 -04:00
Unknown
35d3b674c7 [hotstar] regex the second. 2020-09-18 14:15:34 +02:00
Jody Bruchon
a4d834fb3e Fix wrong variable in position swap corrupting archive list
It's always a simple error in the end, you know?

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-18 00:11:36 -04:00
Jody Bruchon
fda63a4e87 Randomize archive order before populating search tree
This doesn't result in an elegant, perfectly balanced search tree,
but it's absolutely good enough. This commit completely mitigates
the worst-case scenario where the archive file is sorted.

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-17 21:45:40 -04:00
Jody Bruchon
1d74d8d9f6 Try to mitigate the problem of loading a fully sorted archive
Sorted archives turn the binary tree into a linked list and make
things horribly slow. This is an incomplete mitigation for this
issue.
2020-09-17 17:28:22 -04:00
Jody Bruchon
1de7ea76f8 Remove recursion in at_insert() 2020-09-17 15:08:33 -04:00
Jody Bruchon
a5029645ae Remove debugging print statements 2020-09-17 14:46:11 -04:00
Jody Bruchon
ecdec1913f Keep download archive in memory for better performance
The old behavior was to open and scan the entire archive file for
every single video download. This resulted in horrible performance
for archives of any remotely large size, especially since all new
video IDs are appended to the end of the archive. For anyone who
uses the archive feature to maintain archives of entire video
playlists or channels, this meant that all such lists with newer
downloads would have to scan close to the end of the archive file
before the potential download was rejected. For archives with tens
of thousands of lines, this easily resulted in millions of line
reads and checks over the course of scanning a single channel or
playlist that had been seen previously.

The new behavior in this commit is to preload the archive file
into a binary search tree and scan the tree instead of constantly
scanning the file on disk for every file. When a new download is
appended to the archive file, it is also added to this tree. The
performance is massively better using this strategy over the more
"naive" line-by-line archive file parsing strategy.

The only negative consequence of this change is that the archive
in memory will not be synchronized with the archive file on disk.
Running multiple instances of the program at the same time that
all use the same archive file may result in duplicate archive
entries or duplicated downloads. This is unlikely to be a serious
issue for the vast majority of users. If the instances are not
likely to try to download identical video IDs then this should
not be a problem anyway; for example, having two instances pull
two completely different YouTube channels at once should be fine.

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
2020-09-17 14:22:07 -04:00