java使用HttpClient 4.x实现新浪微博的登录 - V2EX
V2EX = way to explore
V2EX 是一个关于分享和探索的地方
Sign Up Now
For Existing Member  Sign In
m_z
V2EX    Java

java使用HttpClient 4.x实现新浪微博的登录

  •  
  •   m_z Sep 17, 2012 7383 views
    This topic created in 4973 days ago, the information mentioned may be changed or developed.
    最近工作需要爬新浪微博的名人堂微博,先是用新浪提供的sdk做了一套,结果客户的网络环境是要用代理的,这样在调用sdk的时候会报错,报的错和一个网上的贴子基本一样:http://forum.open.weibo.com/read.php?tid=48600。就像这个帖子,到现在没有人说出解决方案。所以现在打算直接使用HttpClient 4.x爬。

    希望有经验的说下登录的过程
    14 replies    1970-01-01 08:00:00 +08:00
    twm
        1
    twm  
       Sep 17, 2012
    不用新浪api不好弄吧
    explon
        2
    explon  
       Sep 17, 2012
    不用 SDK 累死你
    holmesabc
        3
    holmesabc  
       Sep 17, 2012
    自己改一下SDK的源码不好一点
    m_z
        4
    m_z  
    OP
       Sep 17, 2012
    @twm 的确

    @explon 同感

    @holmesabc 有这样想过,只是sdk源码也是用的HttpClient实现的,对代理也是给HttpClient设置了代理。你是怎样的改源码的思路呢?
    cooiky
        5
    cooiky  
       Sep 18, 2012
    oauth
    sohoer
        6
    sohoer  
       Sep 18, 2012
    绑定登录后的COOKIE,应该就很简单了,
    m_z
        7
    m_z  
    OP
       Sep 18, 2012
    @sohoer 意思是拿到手动登录成功的cookie么?

    @cooiky 什么思路呢?
    cooiky
        8
    cooiky  
       Sep 18, 2012
    zoran
        9
    zoran  
       Sep 18, 2012
    方法一 重写sdk里httpclient部分代码 老版的httpclient搞不定可以尝试使用httpcomponent替换
    方法二 有点麻烦 抓多抓快了还会被封号封ip 一般是用wap版weibo.cn做入口 模拟登陆,然后设置拿到的cookie 一段时间内就不用在登陆。。开抓
    m_z
        10
    m_z  
    OP
       Sep 19, 2012
    @zoran 觉得方法一比较靠谱,谢谢~~
    hetaoblog
        11
    hetaoblog  
       Sep 19, 2012
    调用sdk前设置下jvm代理
    m_z
        12
    m_z  
    OP
       Sep 19, 2012
    @hetaoblog 这是什么原理啊!?
    hetaoblog
        13
    hetaoblog  
       Sep 19, 2012   1
    @m_z

    http://docs.oracle.com/javase/6/docs/technotes/guides/net/proxies.html

    简单的说,如果客户使用http代理,调用sdk前做如下设置应该可以
    System.setProperty("http.proxyHost", "webcache.mydomain.com");
    System.setPropery("http.proxyPort", "8080");
    m_z
        14
    m_z  
    OP
       Sep 19, 2012
    @hetaoblog 长见识了,谢谢!
    About     Help     Advertise     Blog     API     FAQ     Solana     1334 Online   Highest 6679       Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 42ms UTC 17:03 PVG 01:03 LAX 10:03 JFK 13:03
    Do have faith in what you're doing.
    ubao msn snddm index pchome yahoo rakuten mypaper meadowduck bidyahoo youbao zxmzxm asda bnvcg cvbfg dfscv mmhjk xxddc yybgb zznbn ccubao uaitu acv GXCV ET GDG YH FG BCVB FJFH CBRE CBC GDG ET54 WRWR RWER WREW WRWER RWER SDG EW SF DSFSF fbbs ubao fhd dfg ewr dg df ewwr ewwr et ruyut utut dfg fgd gdfgt etg dfgt dfgd ert4 gd fgg wr 235 wer3 we vsdf sdf gdf ert xcv sdf rwer hfd dfg cvb rwf afb dfh jgh bmn lgh rty gfds cxv xcv xcs vdas fdf fgd cv sdf tert sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf sdf shasha9178 shasha9178 shasha9178 shasha9178 shasha9178 liflif2 liflif2 liflif2 liflif2 liflif2 liblib3 liblib3 liblib3 liblib3 liblib3 zhazha444 zhazha444 zhazha444 zhazha444 zhazha444 dende5 dende denden denden2 denden21 fenfen9 fenf619 fen619 fenfe9 fe619 sdf sdf sdf sdf sdf zhazh90 zhazh0 zhaa50 zha90 zh590 zho zhoz zhozh zhozho zhozho2 lislis lls95 lili95 lils5 liss9 sdf0ty987 sdft876 sdft9876 sdf09876 sd0t9876 sdf0ty98 sdf0976 sdf0ty986 sdf0ty96 sdf0t76 sdf0876 df0ty98 sf0t876 sd0ty76 sdy76 sdf76 sdf0t76 sdf0ty9 sdf0ty98 sdf0ty987 sdf0ty98 sdf6676 sdf876 sd876 sd876 sdf6 sdf6 sdf9876 sdf0t sdf06 sdf0ty9776 sdf0ty9776 sdf0ty76 sdf8876 sdf0t sd6 sdf06 s688876 sd688 sdf86