
This project focuses on automating extraction of research paper metadata such as title, author, keywords, and publication details from uploaded documents and updating institutional repositories, reducing manual cataloging efforts.
Study academic repository management systems.
Identify repetitive metadata entry tasks.
Design automation workflow.
Develop RPA bot to extract metadata from PDFs.
Structure extracted data into database format.
Validate author and publication details.
Update repository records automatically.
Generate cataloging summary reports.
Implement duplicate detection logic.
Test with multiple research papers.
Evaluate metadata extraction accuracy.
Document project results and challenges.