Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1358

Integration of Tika and DataImportHandler

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5
    • None

    Description

      At the moment, it's impossible to configure Solr such that it build up documents by using data that comes from both pdf documents and database table columns. Currently, to accomplish this task, it's up to the user to add some preprocessing that converts pdf files into plain text files. Therefore, I would like to see an integration of Solr Cell into DIH that makes those preprocessing obsolete.

      Attachments

        1. SOLR-1358.patch
          20 kB
          Akshay K. Ukey
        2. SOLR-1358.patch
          7 kB
          Akshay K. Ukey
        3. SOLR-1358.patch
          7 kB
          Noble Paul
        4. SOLR-1358.patch
          7 kB
          Akshay K. Ukey

        Issue Links

          Activity

            People

              noble.paul Noble Paul
              szott Sascha Szott
              Votes:
              2 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: