Introduction
The developed tool is a Java application. It can run on any platform supporting Java Run time Environment (JRE) Framework. The developed tool extracts text from password protected Pdf documents and can store the required information in database. (We have developed this tool with the feature of extracting an individuals information from E-Aadhar and storing it in Microsoft Access database.
Therefore, the main objectives of PET are as follows:
- Extraction of text and images: It allows to extract text and images from password protected Pdf documents.
- Storing data in database: It stores the data extracted in database (Microsoft Access) which creates automatically as user clicks save button.
Overview of PET
The proposed and developed PET is a tool that allows user to extract the desired information and images from Pdf documents (with or without password) and stores it in database. The user can also retrieve the desired information from the database. We have developed this in context of extracting an individuals' information from E-Aadhar (protected with password) and storing it in Access Database.
Operations of PET
As the user installs the application, the following interface will appear:
Fig. i. Main interface of application
The main operations of PET involve the following steps:
- Selecting the Pdf file: As the user clicks on File button, a dialog box will open allowing the user to select the file. (Fig. ii)
Fig. ii. Selecting a Pdf file to open
The user will have to provide the password if the file is password protected. In our case, the password is Pin-Code of the user. (Fig. iii)
Fig. iii. Entering credentials
- Digitization of Information from Pdf: As the file is uploaded, the application will extract the desired information from the file and will show it in the new window.
In our case, we are extracting the information of an individual from E-Aadhar as shown in fig. iv.
Fig. iv. Displaying Extracted information from Pdf
- Storing the information in Database: If the user wants to save the data, he can click on save button, as the user clicks on save button, a database will be created in the folder (Aadhar Data in our case).
Fig. v. Location where database is created. (C:\Aadhardata)
- Retrieving the data from Database: If the user wants to retrieve the information, he can fill in the Unique key value to get the desired data, we have taken the Aadhar no. of user as the Key value. As in Fig vi, user inputs the value and the desired data is fetched from the database (Fig. vii).
Fig. vi