How to parse .doc document without WINWORD? [closed] - python

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
Is there a way parse/replace text in .doc document without WINWORD using python, golang, c++ or other language? I checked structure .doc document and its looks like:
[1]CompObj
[5]DocumentSummaryInformation
[5]SummaryInformation
1Table
WordDocument
But it contains binary data in most place. Basically there is binary data.

In golang, you could use unidoc/unioffice, which is a pure go library for creating and processing Office Word (.docx), Excel (.xlsx) and Powerpoint (.pptx) documents.

Related

Encrypting csv and Json files in Python [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 1 year ago.
Improve this question
I have a python code that reds and writes to json and csv files. I want to secure the information contained within the files. Is there a way I could encrypt json and csv files so that the data stored in them would only be readable by the python code only?

what can I use to convert speech to text (offline)? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
I want to know what libraries are there to convert any voice to text locally (offline). Even if the word is incomprehensible or meaningless (Literally translate words) in python.
Have a look at SpeechRecognition. CMU Sphinx will work offline but may not be as efficient.

How to find some pdf files contain some keyword using python [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I have hundreds of articles on many topics in pdf files in a directory. I need to point some papers containing the keywords git log or git diff command from those hundreds of articles. Then, I will collect the selected articles in a list.
How can we do that using Python?
If you are not opposed to using a library, you can use
https://github.com/euske/pdfminer
I've done something of the sort for nodejs, just recursively scan the directory and scan every file with pdfminer and make it return the results.
Goodluck!

how to copy multiple text files in to excel sheets by python [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I have 20 text files. i have to open them in excel's sheets (in 1 excel file with different sheets with same name of that text file). can anyone please suggest me the logic and which package is good for that. i am new in python so
Try with openpyxl. You should figure out logic yourself, get back when you have more specific questions.

Python PDF Libraries [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I am looking for a good Python PDF library that would parse some format to pdf (I want to use it to programmatically create documents). I have dome some search, and PyPDF looks like an option. Is there any other libraries that I should be aware of?
Here are a few options: ReportLab, PyX, and pyfpdf.

Categories

Resources