If We want to parse HTML in java then JSOUP provide that facility to us.. It provides a API for extracting and manipulating data from web page.By using JSOUP we can select element,traverse document,manipulate html element etc In java code.
Below is example in which we connect to www.google.com and select title, all div text in java code and print them.
package com.kodemaker;
package com.kodemaker;
import java.io.IOException; import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; import org.jsoup.select.Elements; public class JSOUPTest { public static void main(String[] args) { try{ Document doc = Jsoup.connect("http://www.google.com").get(); System.out.println("text : " + doc.select("title").text()); Elements divs = doc.select("div"); for (Element div : divs){ System.out.println("text : " + div.text()); } }catch (IOException e){ e.printStackTrace(); } } }
Above programme will display title of google page i.e. google and display all div text on console.
No comments:
Post a Comment