Python Pandas combine [duplicate] - python

This question already has answers here:
How do I Pandas group-by to get sum?
(11 answers)
Closed last month.
Let's say I have the Python pandas dataframe below, I want to combine the amount paid column based on employee ID.
[dataframe]

Well try:
dataframe.groupby('EMPLOYEE_ID')['AMOUNT_PAID'].sum()

Related

Pandas column separation using .loc [duplicate]

This question already has answers here:
Selecting multiple columns in a Pandas dataframe
(22 answers)
Closed last month.
I am an amateur user.
I watched many videos but I couldn't figure out this error.
How can I keep PERSON_WGHT, LOS, and IDC_DC_CD_1 as a columns for all rows that is 386816.
If you need to select multip0le columns from all the records then use df[[column_list]].
df_new = df[['PERSON_WGHT', 'LOS', 'IDC_DC_CD_1']]

Final value in dataframe showing as not inside dataframe when going through for loop [duplicate]

This question already has answers here:
Python Pandas -- why does the `in` operator work with indices and not with the data?
(1 answer)
How to determine whether a Pandas Column contains a particular value
(11 answers)
Closed 1 year ago.
I'm checking to see if all the values in one column of one dataframe lies in the column of another dataframe. When I run the code below, it says that 4 does not exist in df1. Is there any particular reason for this?
list1=[1,2,3,4]
list2=[1,2,3,4]
df2=pd.DataFrame(list2)
df2.rename(columns={0:"List2"},inplace=True)
df1=pd.DataFrame(list1)
df1.rename(columns={0:"List1"},inplace=True)
for i in df2['List2']:
if i not in df1['List1']:
print(i)

How to query from multiindex dataframe containing data and time as index [duplicate]

This question already has answers here:
selecting from multi-index pandas
(7 answers)
Closed 1 year ago.
Check this image of dataframe
I've posted the picture of dataframe I am working with, I want to pull out data from specific times of a certain date
I've tried
stockdf.loc[("2015-01-01")].loc['09:17:00']
stockdf.loc[("2015-01-01","09:17:00"),:]
Both do not work.
Just try:
stockdf.loc[("2015-01-01", "09:17:00")]
If they're dates:
stockdf.loc[(pd.to_datetime("2015-01-01").date(), pd.to_datetime("09:17:00").time())]

How to convert SQL select statement with particular fields and where clause to pandas dataframe [duplicate]

This question already has answers here:
How do I sum values in a column that match a given condition using pandas?
(3 answers)
Pandas counting and summing specific conditions
(4 answers)
Closed 5 years ago.
I am trying to convert fallowing SQL statement into pandas dataframe in python
SELECT sum(money) from df where sex='female'
I am unable to get this in pandas
Thanks in advance
df.loc[df.sex=='female','money'].sum()

Python Pandas: How can I search for a string in a column? [duplicate]

This question already has answers here:
Filter pandas DataFrame by substring criteria
(17 answers)
Closed 7 years ago.
I have a DataFrame like this:
col1,col2
Sam,NL
Man,NL-USA
ho,CA-CN
And I would like to select the rows whose second column contains the word 'NL', which is something like SQL like command. Does anybody know about a similar command in Python Pandas?
The answer is here. Let df be your DataFrame.
df[df['col2'].str.contains('NL')]
This will select all the records that contain 'NL'.

Categories

Resources