Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
351 views
in Technique[技术] by (71.8m points)

java - Selenium jSoup get data from Javascript Webpage

Have asked a few questions around this recently, but haven't really found what I'm looking for.

I am trying to get all of the matches from http://www.futbol24.com/Live/?__igp=1&LiveDate=20141106 to print out, with time, home team and away team. I understand the content is loaded after the page is.

I have been told to use Selenium and then use jSoup on the result to get the data I want. Does anybody have a tutorial or some sample code they could show me, for how to do it on the website above?

Any examples would be greatly appreciated, thanks

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

If you are going to scrape / datamine someone's site, here are some considerations:

  1. Get permission from the site's owner! If you do not, you will piss off the owner and get blacklisted in the best case, or be served with a lawsuit in the worst case.
  2. Find out if the site exposes an . This is always the better way of scraping a site.
  3. Research tools / libraries that are more appropriate for this task. Some of these include , , , ..... Depending on your level of comfort / knowledge, you may need to research the underlying technologies: , , .....
  4. is a functional test library for browser applications, which makes it a poor choice for this task.

PS: I am fully expecting for this to get downvoted / closed, because discussions / opinions are off-topic for SO.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...