-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
最后一页请求不到: [WARN] [get_posts] 'NoneType' object has no attribute 'group'. args=(2428664072, 6) kwargs={'rn': 30, 'with_comments': True} #207
Comments
就是这些内容。无法使用get_post 获取到。 |
我们月中要开MPEG,巨忙,可能得过两个星期才能修了 |
ce8964e 看看能否解决问题 |
$ curl -s https://n0099.net/tbm/v1/client_tester.php\?type\=replies\&tid\=2428664072\&pn\=6\&client_version\=12.62.1.0 \
| jq '.post_list[] | select(.id == 37592674736) | .content' [
{
"cdn_src_active": "http://c.tieba.baidu.com/c/p/img?src=http://static.tieba.baidu.com/tb/editor/images/qw_cat_small/qw_cat_0004.gif&width=720&height=720&qulity=60&first_gif=1&imgtype=0",
"type": 3,
"bsize": "200,200",
"cdn_src": "http://c.tieba.baidu.com/c/p/img?src=http://static.tieba.baidu.com/tb/editor/images/qw_cat_small/qw_cat_0004.gif&width=720&height=720&qulity=60&first_gif=1&imgtype=0",
"big_cdn_src": "http://c.tieba.baidu.com/c/p/img?src=http://static.tieba.baidu.com/tb/editor/images/qw_cat_small/qw_cat_0004.gif&width=1920&height=640&qulity=100&first_gif=0&imgtype=0",
"is_long_pic": 0,
"show_original_btn": 1
}
] https://tieba.baidu.com/p/2428664072?pid=37592674736#37592674736 |
还有点小错误。 第 108 楼 post 的图片分块没有 图片源地址。FragImage_p(origin_size=0, show_width=200, show_height=200, hash='') Post(contents=Contents_p(objs=[FragImage_p(origin_size=0, show_width=200, show_height=200, hash='')]), sign='', comments=[Comment_p(contents=Contents_pc(objs=[FragText(text='谢了\u3000\u3000\u3000\u3000\u3000\u3000\u3000~~~NDS游戏扯淡群 326448723 \u3000\u3000\u3000\u3000\u3000\u3000\u3000\u3000\u3000\u3000')]), fid=500600, fname='nds', tid=2428664072, ppid=37592674736, pid=37593569606, user=UserInfo_p(user_id=418853149, portrait='tb.1.3046e996.ZynDfFdUIQvqbX5YGTPWjA', user_name='8x7y10z4w', nick_name_new='白骨君♀', level=13, glevel=5, gender=<Gender.MALE: 1>, ip='上海', icons=[], is_bawu=False, is_vip=False, is_god=False, priv_like=<PrivLike.HIDE: 3>, priv_reply=<PrivReply.ALL: 1>), author_id=418853149, reply_to_id=0, floor=108, agree=0, disagree=0, create_time=1376991532, is_thread_author=True)], is_aimeme=False, fid=500600, fname='nds', tid=2428664072, pid=37592674736, user=UserInfo_p(user_id=419118212, portrait='tb.1.78172449.txaW3Z8FW70ErVhXYyVqqA', user_name='xky231314', nick_name_new='xky231314', level=8, glevel=0, gender=<Gender.MALE: 1>, ip='', icons=[], is_bawu=False, is_vip=False, is_god=False, priv_like=<PrivLike.PUBLIC: 1>, priv_reply=<PrivReply.ALL: 1>), author_id=419118212, vimage=VirtualImage(enabled=False, state=''), floor=108, reply_num=1, agree=0, disagree=0, create_time=1376990491, is_thread_author=False) |
#207 (comment) |
@n0099 我知道了。但是这个图片的分块没有任何标识。如果在其他帖子里遇到了类似这个的没有任何标识的图片分块。可以认为这个图片分块就是 qw_cat_small/qw_cat_0004.gif 这个表情吗 |
否
我记得预设的有9张图,这些表情包还允许自定义气泡中的文本(应该是生成的普通图片从而有 |
@n0099 好的,最后一个问题。 |
再读一遍 #207 (comment) 别
|
我的问题可能很可笑,我很抱歉。
https://github.com/n0099/open-tbm/blob/f2388a78d64ed1123be762f0c212a4f0847a4fbe/c%23/crawler/src/Tieba/Crawl/Parser/Post/ReplyParser.cs#L54-L76 实际上我很早就看过了,它似乎只是将URL 简化为仅保留文件名。我没有在这块代码中找到获取表情的方法,所以继续问了很多问题。如果造成了麻烦,我真的很抱歉。 |
这种没法靠hash区分的罕见情况可能得手动导出url了 |
好的,谢谢你们的帮助! |
是
导致您拿不到
这本质上跟您上次问的 #202 之底层逻辑完全相同
我想说的是
这类有损
|
简要描述这个bug
只请求至第5页(102楼),最后一页 (第六页) 的数据请求不到。
如何复现
tid: 2428664072
pn: 6
<2024-07-06 17:16:58.489> [WARN] [get_posts] 'NoneType' object has no attribute 'group'. args=(2428664072, 6) kwargs={'rn': 30, 'with_comments': True}
你希望程序作出何种行为
...
截图(可选)
...
The text was updated successfully, but these errors were encountered: