Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 days ago.
Improve this question
I want to generate a large volume of pdfs, around 60.000 and potentially to reach around 500.000.
For the generations of pdfs I use the library "from docx import Document", and the pdf is about 1-2 pages long.
I want to save these files at a server and the whole process shouldn't take more than 8 hours.
Should I go for a centralized approach ? Or should I go for multithreading or multiprocessing approach? or something else?
Also one of the requirements is to use as few libraries as possible and the python version is 3.6.8.
Please give me your opinion on the subject
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I have hundreds of articles on many topics in pdf files in a directory. I need to point some papers containing the keywords git log or git diff command from those hundreds of articles. Then, I will collect the selected articles in a list.
How can we do that using Python?
If you are not opposed to using a library, you can use
https://github.com/euske/pdfminer
I've done something of the sort for nodejs, just recursively scan the directory and scan every file with pdfminer and make it return the results.
Goodluck!
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I want to develop a script in python that can be used to identify a video clips contained in a video stream.
Could you please suggest me some python libraries or any example scripts to start compare to video files
Thanks
This may be of use - it is image rather than video but the same principles will apply:
https://realpython.com/blog/python/fingerprinting-images-for-near-duplicate-detection/
More generally, the technology you are most likely interested in is usually referred to as video fingerprinting. This is a fairly dynamic domain so its worth doing some googling to get a feel for the latest state of the art.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I recently started working in a lab that generate a lot of data. I need to manipulate files very often for various needs. I have learnt awk programming but it seems not enough for my work. I know python but not to that extent where I can comfortably work on files. Could anyone please suggest to me any book or online tutorial where I can find exclusively the use of Python on files. most of the python books do not dwell intensively on this subject.
thanks
You can find these materials are helpful:
http://www.diveintopython.net/file_handling/file_objects.html
http://opentechschool.github.io/python-data-intro/core/text-files.html
For data manipulation, you may need to improve your skills in string processing, regular expression operations, data structures,....
You can attend this course:
https://www.edx.org/course/introduction-python-data-science-microsoft-dat208x-0
I would recommend familiarizing yourself with the os and sys standard libraries. Here is a comprehensive tutorial that covers both libraries and other necessary aspects of file management in Python.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I am teaching art students the basics of Python and it would be very convenient to add some graphics (drawings, pictures, etc.) in the comments within the code instead of "just" code and text comments.
I have seen some friends using Mathematica and exchanging beautiful files that include graphics, comments, pictures, etc.
Does such a thing exists for Python?
Have you looked at the iPython-Notebook? It allows you to write/run code and use html for notes. If you know html it will be easy to add graphics in too. Not sure if that exactly answers your problem, but it is definitely a nice tool
Link: http://ipython.org/notebook.html
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I'm trying to write a karaoke program in python. Every karaoke software has basic functionality like seeking in the video as well as modulating the pitch of music by half steps. What are some modules that I can use to permit this functionality?
I'm going to use wxPython to write the gui portion if that makes a difference!
Honestly you might want to take a look at PyGame - it has fairly robust libraries for handling stuff like music and movies: http://www.pygame.org/docs/
wxPython has a built in media controller, wx.MediaCtrl, which can play both audio and video. It has most of the basic functionality built in, like seek, pause, etc. I've found it very easy and reliable.