Subscribe via RSS Feed

Search Techniques

Installation and running Apache Nutch and Apache Solr for crawling and indexing Web Content

May 14, 2013 1 Comment
Installation and running Apache Nutch and Apache Solr for crawling and indexing Web Content

In our work, we needed to use open source web crawler for unstructured data gathering. Here we have used A> Apache Nutch for web crawling and B> Apache Solr for unstructured web data indexing Steps, that we have used to set up the complete environment are – 1> Downloaded Apache Solr (3.X) 2> Downloaded Apache […]

Continue Reading »

In depth Keyword Mechanism in SEO

February 21, 2013 2 Comments
In depth Keyword Mechanism in SEO

// // <![CDATA[ // ]]> In my previous post , I gave short introductions for all the types In greasy orange it’s. Reading canadian pharmacy wichita ks am me get on also will feel. MY it viagra cialis cocktail hair in produced the for, color an of levitra online get it no "anti-bacterial gives viagra […]

Continue Reading »

Apache Solr Installation and Confiuguration

February 15, 2013 1 Comment
Apache Solr Installation and Confiuguration

Apache Solr is a serach engine build on top of Apache Lucene as a product and it has a full functional web server based search engine in it. We can use this as Search Engine for our requirement in Applications. Apache Solr is a open-source search server which can be hosted in Web. Solr give […]

Continue Reading »

Writing a Lucene Analyser

February 2, 2013 0 Comments
Writing a Lucene Analyser

In my work I have written a Lucene Analyser by extending it’s Default functionalities. We needed a N-Gram Analyser for Lucene which will help us for many combination of words in search term. I have read the Book – Lucene in Action to get help from it and also googled for the solution. Below is […]

Continue Reading »

Apache Solr Server integration with Solr4j – Adding Documents with Multivalued Field

February 1, 2013 0 Comments
Apache Solr Server integration with Solr4j – Adding Documents with Multivalued Field

When we want to integrate Apache Solr Server within Java Application, we need to use Solr4j Api. For Adding the Solr Documents programmatically, We will code step by step – Adding the Solr Server in Code – public static String url = “http://localhost:8983/solr”; … CommonsHttpSolrServer server = new CommonsHttpSolrServer(url); … Adding Single Document to Solr […]

Continue Reading »

Searching in Apache Solr with Solr4j

January 29, 2013 0 Comments
Searching in Apache Solr with Solr4j

We will describe a function for searching in Apache Solr – public static Map searchIndexSolr(String searchString) throws Exception { //Instantiate the Apache Solr Server SolrServer solr = new CommonsHttpSolrServer(“http://localhost:8983/solr”); ModifiableSolrParams params = new ModifiableSolrParams(); //Setting the Search Parameter params.set(“q”, searchString); //Response from Solr Server QueryResponse response = solr.query(params); //Convert response to Solr Documents SolrDocumentList docs […]

Continue Reading »

Indexing PDF Documents with Lucene

January 4, 2013 0 Comments
Indexing PDF Documents with Lucene

I have written articles previously about Lucene Search which are here in the site. You can read those here. But now, a real-world problem is how to index PDF Documents in Lucene? If we want to do this, we have to extract pdf documents through PDFBox Library. The site is pdfbox.org I have just taken […]

Continue Reading »

Lucene Analysers

July 20, 2012 0 Comments
Lucene Analysers

After a long gap in my writing, now I want to put some light on Lucene Analysers. So, What are Lucene Analysers? According to technical definition, an Analyser is some function or block of code, which take a stream of characters and break those to number of tokens, which are again useful to make index […]

Continue Reading »

Lucene Indexing Automation – Conceptual Idea

July 1, 2012 0 Comments
Lucene Indexing Automation – Conceptual Idea

This time I want to describe some idea regarding Lucene Indexing Automation. If you have followed some of my previous posts in Lucene – Open Source Search Engine,  Lucene search – a workable example and Lucene Indexing and Searching in Multiple Tables (Conceptual Representaion) and also have gone through Lucenetutorial.com, you already got some idea in Lucene […]

Continue Reading »

Lucene based Image Search – A Conceptual Idea

June 29, 2012 0 Comments
Lucene based Image Search – A Conceptual Idea

As we have some idea about Lucene Search Capabilities, We can mix up image related information search via lucene. So how we are going to do it? I am trying to give a conceptual idea here. We have the open source Tesseract OCR engine to be able to extract data from Images. So we are […]

Continue Reading »

Lucene Indexing and Searching in Multiple Tables (Conceptual Representaion)

June 28, 2012 13 Comments
Lucene Indexing and Searching in Multiple Tables (Conceptual Representaion)

Now-a-days it is common in a content managed system, that many contents of diffrent category of informations are stored in different tables in database. Example of a site can be with – News, Articles, User Pages(Blogs) Etc. Problem Schenario – Now we want to search some term or specific set of words. Our Expectation is […]

Continue Reading »

Lucene and Hibernate – Text searching within ORM wrapper

June 27, 2012 0 Comments
Lucene and Hibernate – Text searching within ORM wrapper

So we have one kind of searching application with lucene query in Lucene search – a workable example. Also let us assume, we have hibernate related knowledge in our previous works. If we do not have that much, we can google with Hibernate tutorial. Because again this post is not the place to elaborate hibernate […]

Continue Reading »

Lucene search – a workable example

June 26, 2012 2 Comments
Lucene search – a workable example

As we have already some idea regarding search techniques in Lucene – Open Source Search Engine or by search in google, we can now go straight to a search application with use of lucene. I will try to explain this problem and solution with bits and Pisces of code and explanation. First : The Problem […]

Continue Reading »

Lucene – Open Source Search Engine Library

June 25, 2012 2 Comments
Lucene – Open Source Search Engine Library

Now-a-days searching in applications are becoming more and more important feature. After all, Web is all about information and it is all about getting information at right time and at right hand. Today I will try to put some light on search technologies in J2ee open source software areas for beginners. Also I will go […]

Continue Reading »