Network multithread in python

Question

I'm writing a script in Python that will scrape some pages from my web server and put them in a file. I'm using mechanize.Browser() module for this particular task.

However, I've found that creating one single instance of mechanize.Browser() is rather slow. Is there a way I could relatively painlessly use multihreading/multiprocessing (i.e. issue several GET requests at once)?

Related: stackoverflow.com/questions/4119680/… and stackoverflow.com/questions/4139988/… and stackoverflow.com/questions/6905800/… — amit kumar
– amit kumar, Commented Oct 20, 2011 at 5:17
well, if you don't want to use threading as @ObscureRobot suggested, you can try multiprocessing. — imm
– imm, Commented Oct 20, 2011 at 5:30
ObscureRobot and imm: I don't want CPU threads. As my post says, I want "[to] issue several GET requests at once" - as in HTTP GET request. @phaedrus - thanks, those are an interesting read. Doesn't seem to be very easy to implement, looks like I'd have to rewrite the entire app (over 3000 lines of code) — Bo Milanovich
– Bo Milanovich, Commented Oct 20, 2011 at 5:53

cerberos · Accepted Answer · 2011-10-23 15:24:58Z

1

Use gevent or eventlet to get concurrent network IO.

answered Oct 23, 2011 at 15:24

cerberos

8,0835 gold badges44 silver badges45 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

synthesizerpatel · Accepted Answer · 2011-10-26 11:57:08Z

1

If you want industrial strength Python web scraping, check out scrapy. It uses Twisted for async comms and is blindingly fast. Being able to spider through 50 pages per-second isn't an unrealistic expectation.

answered Oct 26, 2011 at 11:57

synthesizerpatel

28.3k5 gold badges77 silver badges92 bronze badges

Collectives™ on Stack Overflow

Network multithread in python

2 Answers 2

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related