LIS Links

First and Largest Academic Social Network of LIS Professionals in India

Welcome to
LIS Links

Or sign in with:

LIS Links Becoming More Social

LIS Links Mailing List (Broadcast Message)

LIS Links WhatsApp Channel
LIS Links WhatsApp Community
LIS Links Telegram Channel
LIS Links Telegram Group
LIS Links Facebook Page
LIS Links Facebook Group
LIS Links Twitter Profile
LIS Links YouTube Channel

Birthdays

Birthdays Today

Latest Activity

Dr. U. PRAMANATHAN posted an event

Gokhale Institute of Politics and Economics (GIPE), Pune, is organizing a National Workshop on DSpace 9.0, which will be held from 08–10 April 2026 at Gokhale Institute of Politics and Economics (GIPE), Pune

April 8, 2026 at 9am to April 10, 2026 at 5:30pm

Wednesday

0 Comments

Dr Mange Ram posted a blog post

Call for Papers for JLICT, V15, No 1 (January 2026- June 2026)

Wednesday

0 Comments

Ku. Dipti Lodwal posted an event

Two Days International Seminar at Government Law College Ujjain (Madhya Pradesh)

March 28, 2026 at 10:30am to March 29, 2026 at 5:30pm

Wednesday

0 Comments

Harshal Bhimsen Pawar updated their profile

Feb 27

Pritee Sharma posted a status

"Recently, we celebrated National Library Week. We are pleased to share some clips in this regard:https://www.youtube.com/watch?v=NPo4U4CVjMo"

Feb 25

0 Comments

Pritee Sharma posted a status

"Recently, the Dept LIS U.O.U Haldwani successfully organized a 10-day MLIS-105 ICT Workshop. https://www.youtube.com/watch?v=NheFE8Tp3kU"

Feb 25

0 Comments

DHANANJAY updated their profile

Feb 21

JITENDRA MAHAWAR updated their profile

Feb 19

Animesh Das updated their profile

Feb 19

Gangaram Mogara Pawara updated their profile

Feb 19

Navdeep Sharma updated their profile

Feb 19

Ku. Dipti Lodwal posted a status

"2Days Int. Seminar "Libraries and Information Technology:A Global Platform for Multidisciplinary Knowledge Integration&Innovation"28-29 Mar."

Feb 17

0 Comments

Samit Mondal shared DHIRAJ KIRAN CHOGALE's event on Facebook

16th Annual National Level One Week Training Program for Library Support Staff on Emerging Trends and Technologies in Academic Libraries: Upgrading the Skills of Library Support Staff

Feb 17

GAURAV BHUSAN ARYA shared a profile on Facebook

Rashi Parashar

Feb 13

Bidyut Bikash Kalita posted a blog post

Librarian- Sri Venkateswara College, New Delhi

Feb 13

0 Comments

Dhanu shri, Kumari Ankita, Arunima Giri and 2 more joined LIS Links

Feb 13

Dr. Badan Barman posted a discussion

LIS Links Newsletter - 2026: Call for Papers: "Article of Practice"

Feb 13

0 Comments

Abhishek Chourasiya updated their profile

Feb 11

Sweety Angelirie Kharumnuid and Alka Solanki joined LIS Links

Feb 11

Dr. SUDHI S VIJAYAN posted an event

International Conference on Libraries for Sustainable Futures: Collaboration, Technology and Knowledge Sharing (Hybrid) at Thiruvananthapuram

February 10, 2026 at 6pm to February 28, 2026 at 7pm

Feb 11

0 Comments

PDF Metadata Extractor Information Needed

Dear Friends

We are in need of a PDF Metadata Extractor Information, preferably free and not online. Please share the information if anybody using it. Actually it is for using in combination with DSpace software, but we can not go online with our collection.

Any help will be highly appreciated.

Thank you

Subeesh A C

▶ Reply to This

Replies to This Forum

Permalink Reply by Baskar Selvaraj on February 16, 2016 at 15:09

Try ExitTool

http://www.sno.phy.queensu.ca/~phil/exiftool/

I have been using it for extracting metadata from PDFs for using in DSpace. It is possible to extract metadata from all PDFs at one go, if you are familiar with command line options.

S. Baskar

▶ Reply

Permalink Reply by Subeesh A C on February 17, 2016 at 0:25

Thank you very much sir

But I think the tool is extracting data from document properties in my try. Are you getting the appropriate data with exiftool?

Subeesh A C

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:45

Hi,

Using the below command, you can extract all metadata (i.e. all metadata tags associated with the PDF document) from hundreds of PDF documents and save it as CSV file which could be used for doing batch import within DSpace.

In case, if you require only specific tags, then you have to mention the required metadata tags for extracting. I have given an example below for your understanding.

To extract all available metadata tags from the PDF documents and save it as a CSV file

---------------------------------------------------------------------------------------------------------------------

exiftool -csv *.pdf > output.csv

To extract specific metadata tags from the PDF documents and save it as a CSV file

-----------------------------------------------------------------------------------------------------------------------------

exiftool -TAG -Title -TAG -Author -TAG -Producer -TAG -Subject -TAG -Description -TAG -Type -TAG -Keywords -TAG -ISBN -TAG -Isbn -TAG -Createdate -TAG -CourseID -TAG -FileSize -TAG -PageCount -TAG -PDFVersion -d %Y-%m-%d *.pdf -csv > output.csv

Hope this helps.

S. Baskar

LinuXpert Systems

▶ Reply

Permalink Reply by Baskar Selvaraj on February 17, 2016 at 4:54

ExifTool Tag Names

The tables listed below give the names of all tags recognized by ExifTool.

http://www.sno.phy.queensu.ca/~phil/exiftool/TagNames/index.html

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:27

Thank you very much sir

▶ Reply

Permalink Reply by Mujib Rahiman K U on February 17, 2016 at 18:34

I have created a small uitlity for extracting information from pdf files few years ago . it will extract data from all files in a folder and save in tab delimited text file.

you can try it. hope it helps. pls let me know.

i have uploaded the program to google drive. Click here to download

with regards

Mujib Rahiman

KV Kanjikode

▶ Reply

Permalink Reply by Subeesh A C on February 20, 2016 at 19:28

Thanks sir, I will surely let you know.

Regards

Subeesh A C

▶ Reply

Permalink Reply by Subeesh A C on February 22, 2016 at 23:38

Sir

I have checked your software, its a great effort if you have coded it yourself. As I see most of the software(s) are not able to identify the pdf files metadata as we require. I think the problem is mostly revolve around the structure of pdf files itself. In my case the pdf files are not having any standard structure (+ OCR ) in it for the algorithm to extract as it did for any appropriate one. Since we are in hurry and we require more metadata for the current work, we are thinking of indexing it and filtering it later through various categories. Anyway thanks for your reply.