想要去爬取:
中的内容。
点击进去:
http://www.nickjr.tv/shimmer-and-shine/videos/welcome-to-zahramay-falls/
调试看了半天,看到:
地址:
https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/32/12bfa338c7104e9997a6d989c3304a55/1194501/uoxVgvjvuy_HDSHS201A1_1194501_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_1_av.m3u8?null=0&id=AgBwBECN33f4FZcaq1rIvO0NG+3yWSxT7o5s%2fm7KmLjXX8KF571QMDHDJ32KTtTj6UUqhTgZ8Xf2pw%3d%3d&hdntl=exp=1521249303~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f32%2f12bfa338c7104e9997a6d989c3304a55%2f1194501%2fuoxVgvjvuy_HDSHS201A1_1194501_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=e5031d156c0cdbe593ef6e5b30fee2d53fe1f6f5f80e9570bb38755ab422a755
返回:
Content-Type:
application/vnd.apple.mpegurl
像是音频m3u8
地址:
http://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/32/12bfa338c7104e9997a6d989c3304a55/1194501/uoxVgvjvuy_HDSHS201A1_1194501_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/segment4_1_av.ts?null=0&id=AgBwBECN33f4FZcaq1rIvO0NG+3yWSxT7o5s%2fm7KmLjXX8KF571QMDHDJ32KTtTj6UUqhTgZ8Xf2pw%3d%3d&hdntl=exp=1521249303~acl=/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/32/12bfa338c7104e9997a6d989c3304a55/1194501/uoxVgvjvuy_HDSHS201A1_1194501_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/*~data=hdntl~hmac=e5031d156c0cdbe593ef6e5b30fee2d53fe1f6f5f80e9570bb38755ab422a755
返回:
Content-Type:video/MP2T
像是视频。
搜:
application/vnd.apple.mpegurl
File Extension | MIME Type |
.M3U8 | application/x-mpegURL or vnd.apple.mpegURL |
.ts | video/MP2T |
判断是否可以用m3u8的js方式|mayloveless’s Blog
“m3u8文件的content-type:安卓只支持 application/vnd.apple.mpegurl,而ios对于application/x-mpegURL,application/vnd.apple.mpegURL均支持。
p.s videojs 关于HLS技术做了许多工具,非常强大,甚至可以把流读过来然后封装成另外的格式”
-》
所以确定:
此处用的技术,好像是videojs的HLS
包括了:.M3U8的音频和.ts的视频。
然后去找找类似于:
<code><video controls width="300" height="270" src="drive.m3u8"></video> <video controls width="300" height="270"><source src="drive.m3u8"> </video> <video type="application/x-mpegurl" controls width="300" height="270"><source src="drive.m3u8"> </video> <video type="application/vnd.apple.mpegurl" controls width="300" height="270"><source src="drive.m3u8"> </video> </code>
的video的标签,看看能否找到。
以为没有看到 <video
但是看到了相关部分的html:
然后发现看到了:
<code><video class="edge-player-content-element" preload="metadata" style="position: absolute; width: 100%; height: 100%" playsinline="" src="blob:http://www.nickjr.tv/113f5810-da55-4585-ab5f-550fe8935d5e"></video> </code>
然后去打开blob地址试试:
blob:http://www.nickjr.tv/113f5810-da55-4585-ab5f-550fe8935d5e
结果打不开。
单独打开:
http://www.nickjr.tv/113f5810-da55-4585-ab5f-550fe8935d5e
结果:
报错:
通过此处的:
<code><div class="video-player pjs edge-player edge-desktop-platform edge-international-platform edge-gui-share-disabled-state edge-gui-content-active-state edge-gui-active-state edge-gui-pause-active-state" data-contenturi="mgid:arc:episode:nickjr.tv:13f67e12-0f5f-11e6-8a89-0026b9414f30" data-video-type="episode" data-item-type="" data-id="13f67e12-0f5f-11e6-8a89-0026b9414f30" data-auth-required="" data-duration="22:30" data-url-key="welcome-to-zahramay-falls" data-serie-url-key="shimmer-and-shine/" data-episode="201" data-season="false" style="width: 100%; height: 100%;" min-width=" 480px 479px" max-width=" 1438px 767px" min-height=" 230px" max-height=" 309px"><video class="edge-player-content-element" preload="metadata" style="position: absolute; width: 100%; height: 100%" playsinline="" src="blob:http://www.nickjr.tv/113f5810-da55-4585-ab5f-550fe8935d5e"></video> </code>
中的:
data-contenturi = mgid:arc:episode:nickjr.tv:13f67e12-0f5f-11e6-8a89-0026b9414f30
和
data-id = 13f67e12-0f5f-11e6-8a89-0026b9414f30
确定这个
data的ID就是:13f67e12-0f5f-11e6-8a89-0026b9414f30
内容是通过
mgid:arc:episode:nickjr.tv:13f67e12-0f5f-11e6-8a89-0026b9414f30
去获取的
然后果然找到了了对应的:
返回:
<code>{"package": { "version": "1.7.1", "video": { "item": [ { "origination_date": "03-15-2018 21:15:05", "rendition": [ { "cdn": "akamai", "method": "hls", "duration": "326", "type": "application/x-mpegURL", "src": "https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/master.m3u8?hdnea=st%3D1521162845%7Eexp%3D1521177245%7Eacl%3D%2Fi%2Fmtvnorigin%2Fgsp.alias%2Fmediabus%2Fkids.com%2F2016%2F05%2F27%2F05%2F15%2F05%2Fdcbc371f6e59467097ec6bd3a162ba90%2F1194502%2FvqIve0Tr0p_HDSHS201A2_1194502_6386982_%2C384x216_278%2C512x288_498%2C640x360_1028%2C768x432_1528%2C960x540_2128%2C1280x720_3128%2C1920x1080_5128%2C.mp4.csmil%2F*%7Ehmac%3Dd383f1d909eabecd89f13d275dca13c0bd733cc0189f62f10f4ae5a6a96149ca&__a__=off&__b__=450&__viacc__=NONE", "rdcount": "7", "rdminkbps": "284", "rdmaxkbps": "4546", "rdminwidth": "384", "rdmaxwidth": "1920" } ], "transcript": [ { "kind": "captions", "srclang": "en", "label": "English", "typographic": [ { "format": "cea-608", "src": "https://akadl-a.akamaihd.net/5290/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982.scc" }, { "format": "ttml", "src": "https://akadl-a.akamaihd.net/5290/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982.dfxp.xml" }, { "format": "vtt", "src": "https://media-utils.mtvnservices.com/caption/convert?mgid=mgid:file:gsp:alias:/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982.dfxp.xml&accountName=intl.mtvi.com" } ] } ] } ] } }} </code>
然后上面写的很清楚:
rendition(n. 译文;演奏;提供;引渡逃奴 -》 估计表示音频或视频?)部分:
cdn用的是akamai
method方法是hls
时长duration是326,应该是秒,则=5分26秒
type类型是application/x-mpegURL 就是之前帖子里提到的 application/x-mpegURL和application/vnd.apple.mpegURL中的第一种
src源是
https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/master.m3u8?hdnea=st%3D1521162845%7Eexp%3D1521177245%7Eacl%3D%2Fi%2Fmtvnorigin%2Fgsp.alias%2Fmediabus%2Fkids.com%2F2016%2F05%2F27%2F05%2F15%2F05%2Fdcbc371f6e59467097ec6bd3a162ba90%2F1194502%2FvqIve0Tr0p_HDSHS201A2_1194502_6386982_%2C384x216_278%2C512x288_498%2C640x360_1028%2C768x432_1528%2C960x540_2128%2C1280x720_3128%2C1920x1080_5128%2C.mp4.csmil%2F*%7Ehmac%3Dd383f1d909eabecd89f13d275dca13c0bd733cc0189f62f10f4ae5a6a96149ca&__a__=off&__b__=450&__viacc__=NONE
rdcount什么个数是7?
rdminkbps最小流的比特率284
rdmaxkbps最大是4546
rdminwidth最小宽度384
rdmaxwidth最大宽度1920
transcript(n. 成绩单;抄本,副本;文字记录 -》 应该就是字幕)部分
kind类型是captions(n. 题注,字幕;插图说明(caption的复数形式))
srclang 源的语言是en 英文
label标签也是English英文
typographic(adj. 排字上的;印刷上的)
有3种字幕:
cea-608
ttml
vtt
分别打开是:
分别最后都是5分多种的字幕
然后再去打开上面的:
https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/master.m3u8?hdnea=st%3D1521162845%7Eexp%3D1521177245%7Eacl%3D%2Fi%2Fmtvnorigin%2Fgsp.alias%2Fmediabus%2Fkids.com%2F2016%2F05%2F27%2F05%2F15%2F05%2Fdcbc371f6e59467097ec6bd3a162ba90%2F1194502%2FvqIve0Tr0p_HDSHS201A2_1194502_6386982_%2C384x216_278%2C512x288_498%2C640x360_1028%2C768x432_1528%2C960x540_2128%2C1280x720_3128%2C1920x1080_5128%2C.mp4.csmil%2F*%7Ehmac%3Dd383f1d909eabecd89f13d275dca13c0bd733cc0189f62f10f4ae5a6a96149ca&__a__=off&__b__=450&__viacc__=NONE
然后可以直接打开播放视频:
拷贝地址出来
是:
https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/master.m3u8?hdnea=st%3D1521162845%7Eexp%3D1521177245%7Eacl%3D%2Fi%2Fmtvnorigin%2Fgsp.alias%2Fmediabus%2Fkids.com%2F2016%2F05%2F27%2F05%2F15%2F05%2Fdcbc371f6e59467097ec6bd3a162ba90%2F1194502%2FvqIve0Tr0p_HDSHS201A2_1194502_6386982_%2C384x216_278%2C512x288_498%2C640x360_1028%2C768x432_1528%2C960x540_2128%2C1280x720_3128%2C1920x1080_5128%2C.mp4.csmil%2F*%7Ehmac%3Dd383f1d909eabecd89f13d275dca13c0bd733cc0189f62f10f4ae5a6a96149ca&__a__=off&__b__=450&__viacc__=NONE
-》所以应该是可以直接下载的
去试试用下载工具,比如迅雷,看看能否下载
结果只能下载个
6KB的:master.m3u8
无法下载完整视频
看看:
下载视频为:
能否保存
结果也是支持尝试去保存为m3u8
预览是:
内容是:
<code>#EXTM3U #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=492000,RESOLUTION=512x288,CODECS="avc1.77.30, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_1_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=278000,RESOLUTION=384x216,CODECS="avc1.77.30, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_0_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=937000,RESOLUTION=640x360,CODECS="avc1.77.30, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_2_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=1263000,RESOLUTION=768x432,CODECS="avc1.77.30, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_3_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=1822000,RESOLUTION=960x540,CODECS="avc1.77.30, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_4_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=2655000,RESOLUTION=1280x720,CODECS="avc1.640028, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_5_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad #EXT-X-STREAM-INF:PROGRAM-ID=1,BANDWIDTH=4527000,RESOLUTION=1920x1080,CODECS="avc1.640028, mp4a.40.2",CLOSED-CAPTIONS=NONE https://cp450888-vh.akamaihd.net/i/mtvnorigin/gsp.alias/mediabus/kids.com/2016/05/27/05/15/05/dcbc371f6e59467097ec6bd3a162ba90/1194502/vqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil/index_6_av.m3u8?null=0&id=AgBwBECN33f4Ffsnq1oZs9WDn8c129amjiijf966sMSOdsZ2hi4fp4N9KddV09vwIF2uupNW%2fYOlfg%3d%3d&hdntl=exp=1521252731~acl=%2fi%2fmtvnorigin%2fgsp.alias%2fmediabus%2fkids.com%2f2016%2f05%2f27%2f05%2f15%2f05%2fdcbc371f6e59467097ec6bd3a162ba90%2f1194502%2fvqIve0Tr0p_HDSHS201A2_1194502_6386982_,384x216_278,512x288_498,640x360_1028,768x432_1528,960x540_2128,1280x720_3128,1920x1080_5128,.mp4.csmil%2f*~data=hdntl~hmac=99d0090c64e2e6b9254d1aeb49c48447a42f81ce899860779866ac891a6820ad </code>
播放列表
然后再去想办法:
转载请注明:在路上 » 【已解决】爬取nickjr.com中的动画片相关视频字幕等数据