next → ← prev

Tika Introduction

Tika is a content analysis tool, designed and developed by Apache Software Foundation. It is written in Java and used to detect and extract content and metadata from the file.

It supports thousand of file types including .XML, XLS, PDF etc.

It is cross-platform and it's repository is available at github for public access.

History

In 2007, Apache started a project to develop a tool that can extract the content from the file of any type. The prime purpose was to make it more usable with CMS (Content Management System) and Web crawlers. And in 2011, first official version 1.0 was released.

The current stable version of Tika is 1.17, released on December 13, 2017.

Popularity

Tika is used by world wide and top giants are using it for information retrieval. There are most well known companies that use Tika.

FICO (Fair Issac Corporation)
Goldman Sachs
NASA
Drupal (software)
Alfresco (software)

Forbes Magazine published a report on the key role of Tika that was used by 400 journalist to extract 11.5 million documents to get information.

Next TopicTika Features

← prev next →

For Videos Join Our Youtube Channel: Join Now

Feedback

Send your Feedback to [email protected]

Help Others, Please Share

Learn Latest Tutorials

Splunk tutorial

Splunk

SPSS

Swagger tutorial

Swagger

Transact-SQL

Tumblr tutorial

Tumblr

ReactJS

Regex

Reinforcement learning tutorial

Reinforcement Learning

R Programming tutorial

R Programming

RxJS

React Native tutorial

React Native

Python Design Patterns

Python Design Patterns

Python Pillow tutorial

Python Pillow

Python Turtle tutorial

Python Turtle

Keras

Preparation

Aptitude

Logical Reasoning

Reasoning

Verbal Ability

Interview Questions

Company Interview Questions

Company Questions

Trending Technologies

Artificial Intelligence

Artificial Intelligence

AWS

Selenium tutorial

Selenium

Cloud Computing

Cloud Computing

Hadoop tutorial

Hadoop

ReactJS Tutorial

ReactJS

Data Science Tutorial

Data Science

Angular 7 Tutorial

Angular 7

Blockchain Tutorial

Blockchain

Git

Machine Learning Tutorial

Machine Learning

DevOps Tutorial

DevOps

B.Tech / MCA

DBMS

Data Structures tutorial

Data Structures

DAA

Operating System

Operating System

Computer Network tutorial

Computer Network

Compiler Design tutorial

Compiler Design

Computer Organization and Architecture

Computer Organization

Discrete Mathematics Tutorial

Discrete Mathematics

Ethical Hacking

Ethical Hacking

Computer Graphics Tutorial

Computer Graphics

Software Engineering

Software Engineering

Web Technology

Cyber Security tutorial

Cyber Security

Automata Tutorial

Automata

C Language tutorial

C Programming

C++

Java

.Net Framework tutorial

.Net

Python tutorial

Python

List of Programs

Programs

Control Systems tutorial

Control System

Data Mining Tutorial

Data Mining

Data Warehouse Tutorial

Data Warehouse

^{Like/Subscribe us for latest updates or newsletter}

Subscribe to Get Email Alerts

YouTube