solr - how to index all metatags in nutch -


I have installed the 1.9.9 and have configured it to crawl successfully with Solar 4.10.1. I am trying to define index metadata in Nutch as defined by

How can I set it in an index of all metadata on a site? I like the values ​​for metatags.names * like

< Pre class = "snippet-code-html lang-html prettyprint-override"> & lt; Property & gt; & Lt; Name & gt; Metatags.names & lt; / Name & gt; & Lt; Price & gt; * & Lt; / Pricing & gt; & Lt; Details & gt; To remove metatag names, separated by ','. Use '*' to remove all metatags, prefix names with 'metatags'. For index details and keywords in parse-metadata, for example, you must activate the plugin index-metadata and set the parameter 'index.parse.md' to 'Metatag Desassembly, Metatag .keyword'. & Lt; / Description & gt; & Lt; / Property & gt;

But I am unsure about setting the value of index.parse.md without listing individual metatog names. I tried it

  & lt; Property & gt; & Lt; Name & gt; Index.parse.md & lt; / Name & gt; & Lt; Price & gt; Meta * & lt; / Pricing & gt; To create a field from parse metadata & lt; Description & gt; A comma-separated list of keys can be used for example 'description' or 'keyword' provided that these values ​​have been created by the parser (see parse-metatag plugins) & lt; / Description & gt;          / P> 

Snippet "data-lang =" js "data-hide =" false ">

  bin / nutch indexchecker http://nutch.apache.org/  

and I'm sure the metadata Because it gives the metadata parsed when

  bin / nutch parsechecker http://nutch.apache.org/  
< / P>

Any help would be greatly appreciated! The

The plugin index-metadata does not work in this way, you must specify the full name, e.g. "Metatag.keywords".

In addition to "metatags.names" the value "" is not actually wildcard, you can not do anything like "Meta ".


Comments

Popular posts from this blog

winforms - C# Form - Property Change -

javascript - amcharts makechart not working -

java - Algorithm negotiation fail SSH in Jenkins -