Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
6.5.1
-
Indexing websites using URL, recurvice and depth ( i.e. in DATA_WEB_MODE )
Description
When Indexing is done using SimplePostTool or Using SolrJ or any means, with data source as a URL. If fetched HTML page contains <script> and <style> tags inside <body> tag (not in <head> tag ) then after Posting document to Solr collection using "sample_techproducts_configs" configuration, the innerText ( i.e. EMAC/JS scripts and CSS styles) remains as part of document text inside the "content"/"text" field in Index documents.