Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
Description
The SimplePostTool fails to grab web pages in simple cases.
The getLinksFromWebPage process fails to detect url within the html page in line 1252. Seams to be a problem when the html page is not perfect, from the xml point of view.
Example to reproduce the problem :
java -Dc=techproducts -Ddata=web -Drecursive=3 -jar example\exampledocs\post.jar http://www.google.com/