Intro
I'd like to create some sort of archiving script, which would collect all Outlook (unicode) emails's date, sender (name+address), recipient(s) (name(s)+address(es)), subject and put them in a CSV file.
(The extra super solution would be if it could extract the containing folders' name and possible categories as well - although it is not a must.
And as final step, I would like to make it portable, so others could use it without having Python.)
(I'm using Python 2.7 and Outlook 2013)
Code
Here's what I have so far:
import win32com.client
import sys
import unicodecsv as csv
output_file = open('./outlook_farming_001.csv','wb')
output_writer = csv.writer(output_file, delimiter = ";", encoding='latin2')
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.GetDefaultFolder(6) # "6" refers to the index of a folder - in this case,
# the inbox.
messages = inbox.Items
for i, message in enumerate(messages): # enumerated the items
try:
sender = message.SenderName
sender_address = message.sender.address
sent_to = message.To
date = message.LastModificationTime
subject = message.subject
output_writer.writerow([
date,
sender,
sender_address,
sent_to,
subject])
except Exception as e:
()
output_file.close()
The questions:
How to make sure it extracts all the email? (When I run the script, it works, but it extracted only 1555 emails, although my Outlook Inbox sais, it contains 4785.)
How to make it work on all the Outlook folders? (It only deals with Inbox, but I would need all the other folders (sent, and other created ones))
How to get the recipients' email address? (I can only extract the screened names)
If you have any tip for any of the questions, that would be greatly appreciated.
Thanks in advance!!
For question 2:
inbox = outlook.GetDefaultFolder(6)
"6" in the code refers to Inbox.
for folder in outlook.Folders:
print(folder.Name)
use the above for loop to look for all the folders in your mailbox.
For question 3:
To get sender email ID, you can use this piece of code:
messages = inbox.Items
message = messages.GetFirst()
sender_emailid =message.SenderEmailAddress
For question 1:
I dont have an answer. Sorry
Related
I am trying to download and save the outlook email attachment from the most recent email in a folder.
I have a code that downloads all of the attachment from a outlook folder and saves it.
Any help is appreciated.
from pathlib import Path
import win32com.client
output_dir = Path.home()/r"Documents\Test"
output_dir.mkdir(parents=True, exist_ok=True)
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.GetDefaultFolder(6).folders("Sample Folder").folders("Sample Subfolder")
messages = inbox.Items
message = messages.GetFirst()
for message in messages:
if message.Subject == 'Sample Subject' or message.Subject == 'Sample Subject 2':
attachments = message.Attachments
subject = messages.GetFirst().Subject
for attachment in attachments:
attachment.SaveAsFile(output_dir / str(attachment))
I am trying to download and save the outlook email attachment from the most recent email in a folder.
To get the most recent item from the folder you need to sort the collection first by using the Sort method in the following way (VBA):
messages = inbox.Items
messages.Sort("[RecievedTime]", false)
message = messages.GetFirst()
Also iterating over all items in the folder is not really a good idea:
for message in messages:
if message.Subject == 'Sample Subject' or message.Subject == 'Sample Subject 2':
Instead, you need to use the Find/FindNext or Restrict methods of the Items class. They allows getting items that correspond to your conditions without iterating over all items in the folder. Read more about these methods in the articles that I wrote for the technical blog:
How To: Use Find and FindNext methods to retrieve Outlook mail items from a folder (C#, VB.NET)
How To: Use Restrict method to retrieve Outlook mail items from a folder
I hope you are well.
I have been doing some python practice with web scraping and have come across the win32com library and have been really struggling to get the attachment from my outlook email ( it is just the one)! I have managed to view all of the emails by the following code. I was wondering if you could help me with getting the attachment file called lets say "data_bay1.xlsx" as i am stumped on the errors. I can so far get the data from the email but i cannot get the attachment xlsx file and the attachment part of the code when i try to run it gives me errors. Please see what I have done so far and I hope any of you can help. I am using anaconda and windows. Thank you in advance! Kind regards, Lily
import win32com.client
#connecting python to outlook
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
#connecting to our inbox
inbox = outlook.GetDefaultFolder(6)
(inbox)
# here we are ensuring the indexes exist so we can grab data
outlook=win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
for i in range(50):
try:
box = outlook.GetDefaultFolder(i)
name = box.Name
print(i, name)
except:
pass
messages = inbox.items
messages = messages.GetFirst() earlier
# get the last email
#message = messages.GetLast()
(message)
#to loop through the email in the inbox
while True:
try:
print(message.subject) # get the subject of the email
# if you use messages.GetFirst() earlier
message = messages.GetNext()
# if you use messages.GetPrevious() earlier
#message = messages.GetPrevious()
except:
# if you use messages.GetFirst() earlier
message = messages.GetNext()
# if you use messages.GetPrevious() earlier
#message = messages.GetPrevious()
# get the attachment
attachments = message.Attachments# return the first item in attachments
attachment = attachments.Item(1)
# the name of attachment file
attachment_name = str(attachment).lower()
attachment.SaveASFile(path+ 'C:\Users\lily\OneDrive - Dataenv\Documents + attachment_data_bay1)
Iterating over all items in the folder is not really a good idea:
#to loop through the email in the inbox
while True:
try:
print(message.subject) # get the subject of the email
# if you use messages.GetFirst() earlier
message = messages.GetNext()
# if you use messages.GetPrevious() earlier
#message = messages.GetPrevious()
except:
# if you use messages.GetFirst() earlier
message = messages.GetNext()
# if you use messages.GetPrevious() earlier
#message = messages.GetPrevious()
Instead, you need to use the Find/FindNext or Restrict methods of the Items to find items that correspond to your conditions. Read more about these methods in the following articles:
How To: Use Find and FindNext methods to retrieve Outlook mail items from a folder (C#, VB.NET)
How To: Use Restrict method to retrieve Outlook mail items from a folder
The best you can do is filter the list down to only the items with attachments. For that use PR_HASATTACH (DASL name is http://schemas.microsoft.com/mapi/proptag/0x0E1B000B) MAPI property and then loop through the returned items and process their Attachments collection.
I'm trying to automate getting attachments from certain emails and the documentation for win32com.client is horrendous.
So far I've got the following:
import win32com.client as win32
import os
outlook = win32.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.Folders["Payments"].Folders["Inbox"]
messages = inbox.Items
for i in range(10):
message = messages.GetNext()
print(message.Sender)
print(message.Subject)
print(message.ReceivedTime)
attachment = message.attachments
for j in attachment:
j.SaveAsFile(os.getcwd() + "\\" + j.FileName)
However, I only want to get attachments from say "payments#email.com" which I can't figure out to do.
Is there a way to only get the emails and their attachments from certain senders (bonus if I can also filter for the email title)?
Use a restriction like filteredItems = Inbox.Items.Restrict("[SenderEmailAddress] = 'payments#email.com' ")
The documentation is at https://learn.microsoft.com/en-us/office/vba/api/outlook.items.restrict
I am trying to connect to outlook using python and read emails and write them to an output file along with all the corresponding attachments.
This is what I have so far:
import win32com.client
import unicodecsv as csv
import os
output_file = open('./outlook_farming_001.csv','wb')
output_writer = csv.writer(output_file, delimiter=';', encoding='latin2')
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.GetDefaultFolder(6).Folders.Item("Security Availabilities")
messages = inbox.Items
for i, message in enumerate(messages):
try:
sender = message.SenderName
sender_address = message.SenderEmailAddress
sent_to = message.To
date = message.LastModificationTime
subject = message.subject
body = message.body
attachments = message.Attachments
attachment = attachments.Item(1)
for attachment in message.Attachments:
attachment.SaveAsFile(os.path.join(output_file, str(attachment)))
output_writer.writerow([
sender,
sender_address,
subject,
body,
attachment])
except Exception as e:
()
output_file.close()
Without the attachment stuff in the code- it works fine. I am able to read all the emails from my specific subfolder.
However, I am unable to read, save and display attachments along with their corresponding emails.
I think your mistake is in using str(attachment) in the filename - this will cause trouble because it should give some sort of '<COMObject <unknown>>' string.
Instead, use the following:
for attachment in message.Attachments:
attachment.SaveAsFile(os.path.join(output_file, attachment.FileName))
I hope this helps!
I am trying to use python to go through outlook and get all emails by a sender. I have looked but can't find out how to do this. I can get an email by subject and return the sender, but I am looking to get all senders and then return the subject? This is what I am using to get sender by subject.
import win32com.client
outlook = win32com.client.Dispatch("Outlook.Application").GetNamespace("MAPI")
inbox = outlook.GetDefaultFolder(6) # "6" refers to the index of a folder - in this case,
# the inbox. You can change that number to reference
# any other folder
messages = inbox.Items
message = messages("Test 08/18/14")
print(message.sender)
This returns the sender for the mail with the subject "Test 08/19/14"
I would like to go through my email and get all email subjects from a certain sender.
It looks like you're looking for the SenderEmailAddress property.
You could go through your messages for a particular sender via:
for m in messages:
if m.SenderEmailAddress == 'some_sender#somewhere.com':
print(m)