solr - how to index all metatags in nutch -
I have installed the 1.9.9 and have configured it to crawl successfully with Solar 4.10.1. I am trying to define index metadata in Nutch as defined by
How can I set it in an index of all metadata on a site? I like the values for metatags.names * like
& lt; Property & gt; & Lt; Name & gt; Metatags.names & lt; / Name & gt; & Lt; Price & gt; * & Lt; / Pricing & gt; & Lt; Details & gt; To remove metatag names, separated by ','. Use '*' to remove all metatags, prefix names with 'metatags'. For index details and keywords in parse-metadata, for example, you must activate the plugin index-metadata and set the parameter 'index.parse.md' to 'Metatag Desassembly, Metatag .keyword'. & Lt; / Description & gt; & Lt; / Property & gt;
But I am unsure about setting the value of index.parse.md without listing individual metatog names. I tried it
& lt; Property & gt; & Lt; Name & gt; Index.parse.md & lt; / Name & gt; & Lt; Price & gt; Meta * & lt; / Pricing & gt; To create a field from parse metadata & lt; Description & gt; A comma-separated list of keys can be used for example 'description' or 'keyword' provided that these values have been created by the parser (see parse-metatag plugins) & lt; / Description & gt; / P> Snippet "data-lang =" js "data-hide =" false ">
bin / nutch indexchecker http://nutch.apache.org/
and I'm sure the metadata Because it gives the metadata parsed when
bin / nutch parsechecker http://nutch.apache.org/
Any help would be greatly appreciated! The
The plugin index-metadata does not work in this way, you must specify the full name, e.g. "Metatag.keywords".
In addition to "metatags.names" the value "" is not actually wildcard, you can not do anything like "Meta ".
Comments
Post a Comment