I've been tasked with creating a search system that will help users to navigate through multiple 1000+ page pdf files. However, these files will first have to be put on a MySQL DB. The issue that i'm currently having is how do I store these PDF files on the DB and assign the relevant PDF headers to the DB.
Example:
Adding each Part/Header/Section/Subsection individually on the DB in different tables.
Would this all have to be manually entered? Bare in mind we are talking 100,000s pages + of PDF.
Thanks
You would be better to store some meta data in the database, and store the location of the PDF file.
i.e. a table called 'documents' may have the following fields:
id,path,keywords,category
The path would be: /some/location/to/my/pdf/file.pdf
The keywords could be; 'pdf1, what is a pdf, some search text'
This will allow you to store the pdf files.
Alternatively you could use something like Google - they allow you to use their search technology. It used to be in the form of a 'google yellow box' but I believe it's now part of their cloud stuff!
HTH
Related
I'm using Laravel7 with Aws-S3 as file storage.
The files are PDF only and and I want to add search files feature i.e. If a user search for a text, I want to list all PDF files that has the matching text.
Is this something possible using Laravel alone or using Aws/S3?
I know I can extract all the text of a file on upload and store it in database, and when user search for a text, I can search it from database using %LIKE% query but this will take a huge database space.
I'm looking for something better.
I have 70gb PDF files, and I want to search inside them with PHP and some Ajax.
The code must search on all PDF files and extract the data out into table,
For example: 1547AD
When I hit enter the code will search in all PDF files and extract all PDF files that contain "1547AD" inside them.
My problem is: of course putting these data inside MySQL will be better for the server and stronger but imagine extracting all tables in 70GB of PDF files! and these pdf files updated daily, also there is alot of traffic on this page.
My question is: Is it the right way to build this in PHP or I should use another language and/or another method for this kind of heavy data?
I am currently working with a new website for my company and there is some functionality that I have some issue fixing.
We want a page where you will find a list of filenames (like different types of manuals etc.) and basically a checkbox where you decide what files you want to merge together to one new pdf.
So, I am a web designer, not great at developing but I can not find any guidelines or help to achieve this.
How do I link my pdf's to the database and how do I create a new pdf (like in acrobat) from the ones I already have? If someone have any good edwice, I would be really glad.
Suggestion:
Save the path to your pdf's on the database. After that you'll need to query the pdf file path according to the files the user wants to merge (checkboxes), loop through the files and merge them using the for example pdftk tool, executing a command like this:
passthru('pdftk file1.pdf file2.pdf file3.pdf cat output mergedFile.pdf');
Then output mergedFile.pdf.
Also take a look at: Merge PDF files with PHP
Does anyone know of a image tagging solution for websites that reads and writes the tag data to a database instead of a locally stored file? I've looked at http://jquery-notes.rydygel.de/index.php but the current version doesn't have database support. I just need a solution that can be used on all the mainstream browsers. The reason I don't want to read and write from a local file is because there might be a lot of images, and I don't want to store everything locally. Thanks!
You will be need to store the notes to a database table with following rows
ID
IMAGE_ID
LEFT
TOP
WIDTH
HEIGHT
DATE
NOTE
AUTHOR
LINK
And i believe the plugin has a php class for retrieving the notes. The file is notes.class.php
What is the best way to store documents that saves space? By letting user documents to be stored in phpmyadmin db or by creating folders and letting user upload documents there?
I want to allow scans like pdf and images and also word docs and .txt files.
Also, how will i go about uploading documents? Any tutorials i can use? New to this.
Thanks!
phpmyadmin, is an script that lets you edit\add\etc mysql db's its not a db. Depending on the type of documents, and what you plan to do with them I would always say store them in the file system, you can store there name\path in the db.
file upload tutorial: Handling file uploads