How to remove letters and space from line in python?

How to remove letters and space from line in python? - python

Im new in python and I have problem like this:
I have a list
Index = ['TH', '1', '1', '2', '28', '29', '2', '']
and I wat to get
max(Index)
I have something like this sofar
import numbers
Index = ['TH', '1', '1', '2', '28', '29', '2', '']
Index1 = [x for x in Index if x] #to remove empty space
Index2 = [x for x in Index1 if isinstance(x, numbers.Number)] #to remove all letters
Index3 = map(int, Index2)
print (max(Index3))
But output from Index1 and Index2 is always [].

Perhaps you meant to use the isdigit() method of strings:
Index = ['TH', '1', '1', '2', '28', '29', '2', '']
Index1 = [x for x in Index if x] #to remove empty space
Index2 = [x for x in Index1 if x.isdigit()] #to remove all letters
Index3 = map(int, Index2)
print (max(Index3))
Output:
29

Index = ['TH', '1', '1', '2', '28', '29', '2', '']
my_index = [int(elem) for elem in Index if elem.isnumeric()]

Index = ['TH', '1', '1', '2', '28', '29', '2', '']
print(max(map(int,filter(str.isdigit,Index))))
Filter out anything not a digit,
filter(str.isdigit,Index)
then map all to integers,
map(int,)
Then get the max.
max()

Index = ['TH', '1', '1', '2', '28', '29', '2', '']
result = [i for i in Index if not(not i or i.isalpha())]
print(result)
Output:- ['1', '1', '2', '28', '29', '2']

Related

Python: How to convert a set of strings that contains commas represented as an element of a list to a sub-list?

I have a list that contains a set of strings like this:
list = ['235,ACCESS,19841136,22564960,4291500,20,527434,566876','046,ALLOWED,24737321,27863065,1086500,3,14208500,14254500']
I'm trying to make the elements of the list a sublist but without splitting the string.
I tried new_list = list(map(list, list)). This is the result taking as reference the first element of the list:
print(new_list[0]):
[['2', '3', '5', ',', 'A', 'C', 'C', 'E', 'S',',','1', '9', '8', '4', '1', '1', '3', '6', ',', '2', '2', '5', '6', '4', '9', '6', '0', ',', '4', '2', '9', '1', '5', '0', '0', ',', '2', '0', ',', '5', '2', '7', '4', '3', '4', ',', '5', '6', '6', '8', '7', '6']]
I would like this output:
print(new_list[0]):
[[235,'ACCESS',19841136,22564960,4291500,20,527434,566876]]
Thanks in advance for your help!

You can try split() with delimiter , like this -
new_list = [i.split(',') for i in list]
print (new_list[0])
Output:
['235', 'ACCESS', '19841136', '22564960', '4291500', '20', '527434', '566876']
One thing is that here the numbers are also represented as string. If you want integers instead you can use isdigit() method like this -
new_list = [[int(e) if e.isdigit() else e for e in i.split(',') ]for i in list]
print(new_list[0])
Output:
[235, 'ACCESS', 19841136, 22564960, 4291500, 20, 527434, 566876]
Also, please try to avoid naming your list list

Split list on smaller lists with equal elements

In my list "A" i have got numbers and ' ', so I want to make a list of list named e.g "b", every list should have nine number (if it possible), no matter how much it have ' '.
Any idea how to do this?
A = ['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16', '3', ' ', '5', '17']
B = [ ['1', '3, '4', '5', '7', '8', '9', ' ', '13', '16'], ['3', ' ', '5', '17'] ]

This will help you:
>>> a = ['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16', '3', ' ', '5', '17']
>>> b=[a[i:i+9] for i in xrange(0,len(a),9)]
>>> b
[['1', '3', '4', '5', '7', '8', '9', ' ', '13'], ['16', '3', ' ', '5', '17']]
>>>

This can be done with two nested while loops:
>>> A = ['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16', '3', ' ', '5', '17']
>>> B = []
>>> while A:
... L = []
... c = 0
... while A and c < 9:
... L.append(A.pop(0))
... if L[-1].isdigit():
... c += 1
... B.append(L)
...
>>> B
[['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16'], ['3', ' ', '5', '17']]
The outer one loops while A is not empty and the inner one while A is not empty and the number of digit only strings appended to the current sub-list is less than 9. The counter is only incremented after a string consisting of only digits is found.

It would be worth your time to get deep into list comprehensions
And there is no xrange in Python 3.x or rather range (in 3.x) does exactly what xrange did in Python 2.x.
A = ['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16', '3', ' ', '5', '17']
B = [i for i in A[0:9]] #is cleaner.
Though I'm not sure exactly what your goal is. Do you want the second list (the remainder list as I'm thinking of it) to be in the same variable? So if you had 28 elements in your list you'd want three lists of 9 and one list of 1?

This is a bit dirty solution but I think you might need to check isdigit part and pop.
def take(lst, n):
if not lst:
return ValueError("Empty list, please check the list.")
items = list(lst)
new_list = []
count = 0
while items:
item = items.pop(0)
new_list.append(item)
if item.isdigit():
count += 1
if count >= n:
yield new_list
new_list = []
count = 0
if new_list:
yield new_list
A = ['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16', '3', ' ', '5', '17']
B = [ii for ii in take(A, 9)]
#[['1', '3', '4', '5', '7', '8', '9', ' ', '13', '16'], ['3', ' ', '5', '17']]
Check the following:
https://docs.python.org/2/library/stdtypes.html#str.isdigit

python all combinations of subsets of a string

I need all combinations of subsets of a string. In addition, a subset of length 1 can only be followed by a subset with length > 1. E.g. for string 4824 the result should be:
[ [4, 824], [4, 82, 4], [48, 24], [482, 4], [4824] ]
So far I managed to retrieve all possible subsets with:
length = len(number)
ss = []
for i in xrange(length):
for j in xrange(i,length):
ss.append(number[i:j + 1])
which gives me:
['4', '48', '482', '4824', '8', '82', '824', '2', '24', '4']
But I don't know how to combine those now.

First, write a function for generating all the partitions of the string:
def partitions(s):
if s:
for i in range(1, len(s) + 1):
for p in partitions(s[i:]):
yield [s[:i]] + p
else:
yield []
This iterates all the possible first segments (one character, two characters, etc.) and combines those with all the partitions for the respective remainder of the string.
>>> list(partitions("4824"))
[['4', '8', '2', '4'], ['4', '8', '24'], ['4', '82', '4'], ['4', '824'], ['48', '2', '4'], ['48', '24'], ['482', '4'], ['4824']]
Now, you can just filter those that match your condition, i.e. those that have no two consecutive substrings of length one.
>>> [p for p in partitions("4824") if not any(len(x) == len(y) == 1 for x, y in zip(p, p[1:]))]
[['4', '82', '4'], ['4', '824'], ['48', '24'], ['482', '4'], ['4824']]
Here, zip(p, p[1:]) is a common recipe for iterating over all pairs of consecutive items.
Update: Actually, incorporating your constraint directly into the partition function is not that hard, either. Just keep track of the last segment and set the minimum length accordingly.
def partitions(s, minLength=1):
if len(s) >= minLength:
for i in range(minLength, len(s) + 1):
for p in partitions(s[i:], 1 if i > 1 else 2):
yield [s[:i]] + p
elif not s:
yield []
Demo:
>>> print list(partitions("4824"))
[['4', '82', '4'], ['4', '824'], ['48', '24'], ['482', '4'], ['4824']]

would be interesting to see more test cases, the following algorithm does what you say:
s="4824"
def partitions(s):
yield [s]
if(len(s)>2):
for i in range(len(s)-1, 0, -1):
for g in partitions(s[i:]):
out = [s[:i]] + g
if not any([len(out[i]) == len(out[i+1]) and len(out[i])==1 for i in range(len(out)-1)]):
yield out
list(partitions(s))
you get:
[['4824'], ['482', '4'], ['48', '24'], ['4', '824'], ['4', '82', '4']]
explanation
I based on the following algorithm:
s="4824"
def partitions_original(s):
#yield original string
yield [s]
if(len(s)>2):
for i in range(len(s)-1, 0, -1):
#divide string in two parts
#iteration 1: a="482", b="4"
#iteration 2: a="48", b="24"
#iteration 3: a="4", b="824"
a = s[:i]
b = s[i:]
#recursive call of b
for g in partitions_original(b):
#iteration 1: b="4", g=[['4']]
#iteration 2: b="24", g=[['24']]
#iteration 3: b="824", g=[['824'], ['82', '4'], ['8', '24']]
yield [a] + g
list(partitions_original(s))
you get:
[['4824'], ['482', '4'], ['48', '24'], ['4', '824'],
['4', '82', '4'], ['4', '8', '24']]
the problem is ['4', '8', '24'] ..... then I must add if to code, because "a subset of length 1 can only be followed by a subset with length > 1"
[len(out[i]) == len(out[i+1]) and len(out[i])==1 for i in range(len(out)-1)] return for ['4', '8', '24'] -> [True, False] .... any Return True if any element of the iterable is true
NOTE
also it can be used:
if all([len(out[i]) != len(out[i+1]) or len(out[i])!=1 for i in range(len(out)-1)]):

What I am doing here is to get all the possible split position of the string and eliminate the last one.
for example, in some string with 5 numbers "12345" for ex., there is 4 possible position to split the string, call it possibility = (0,0,0,0),(1,0,1,0)... with (0,0,1,0) mean (don't separate 1 and 2345,don't separate 12 and 345,separate 123 and 45,don't separate 1234 and 5) so you can get all possibilities while your condition is verified since we eliminate the (1,1,1,1) case.
import itertools
from math import factorial
from itertools import product
def get_comb(string):
L = len(string_)
combinisation = []
for possibility in product([0,1], repeat=len(string_)-1):
s = []
indexes = [i for i in range(len(string_)-1) if list(possibility)[i]!=0]
if sum(indexes) != 0:
if sum(indexes) != len(string_)-1:
for index in indexes:
s.append(string_[:index+1])
s.append(string_[indexes[-1:][0]+1:])
combinisation.append(s)
else:
combinisation.append(string_)
return combinisation
string_ = '4824'
print "%s combinations:"%string_
print get_comb(string_)
string_ = '478952'
print "%s combinations:"%string_
print get_comb(string_)
string_ = '1234'
print "%s combinations:"%string_
print get_comb(string_)
>>
4824 combinations:
[['482', '4'], ['48', '24'], '4824', ['4', '482', '4'], ['4', '48', '24'], '4824
']
478952 combinations:
[['47895', '2'], ['4789', '52'], ['4789', '47895', '2'], ['478', '952'], ['478',
'47895', '2'], '478952', ['478', '4789', '47895', '2'], ['47', '8952'], '478952
', ['47', '4789', '52'], ['47', '4789', '47895', '2'], ['47', '478', '952'], ['4
7', '478', '47895', '2'], ['47', '478', '4789', '52'], ['47', '478', '4789', '47
895', '2'], ['4', '47895', '2'], ['4', '4789', '52'], ['4', '4789', '47895', '2'
], ['4', '478', '952'], ['4', '478', '47895', '2'], '478952', ['4', '478', '4789
', '47895', '2'], ['4', '47', '8952'], '478952', ['4', '47', '4789', '52'], ['4'
, '47', '4789', '47895', '2'], ['4', '47', '478', '952'], ['4', '47', '478', '47
895', '2'], ['4', '47', '478', '4789', '52'], ['4', '47', '478', '4789', '47895'
, '2']]
1234 combinations:
[['123', '4'], ['12', '34'], '1234', ['1', '123', '4'], ['1', '12', '34'], '1234
']

A normal code can be written like:
s=raw_input('enter the string:')
word=[]
for i in range(len(s)):
for j in range(i,len(s)):
word.append(s[i:j+1])
print word
print 'no of possible combinations:',len(word)
And output:
enter the string:
4824
['4', '48', '482', '4824', '8', '82', '824', '2', '24', '4']
no of possible combinations:10

Delete blank entries in CSV file Python

I need to read in a CSV file, from Excel, whose rows may be an arbitrary length.
The problem is the python retains these blank entries, but need to delete them for a future algorithm. Below is the output, I don't want the blank entries.
['5', '1', '5', '10', '4', '']
['3', '1', '5', '10', '2', '']
['6', '1', '5', '10', '5', '2']
['9', '10', '5', '10', '7', '']
['8', '5', '5', '10', '7', '']
['1', '1', '5', '10', '', '']
['2', '1', '5', '10', '1', '']
['7', '1', '5', '10', '6', '4']
['4', '1', '5', '10', '3', '1']

Here's a list comprehension integrated with the csv library:
import csv
with open('input.csv') as in_file:
reader = csv.reader(in_file)
result = [[item for item in row if item != ''] for row in reader]
print result

This is about as verbose a function as I could write to do what you want. There are certainly slicker ways.
def remove_blanks(a_list):
new_list = []
for item in a_list:
if item != "":
new_list.append(item)
return new_list

List comprehension version:
a = ['5', '1', '5', '10', '4', '']
[x for x in a if x != '']
Out[19]: ['5', '1', '5', '10', '4']
You may be better served by filtering at the csv read step instead.

Integer Manipulation of arrays python

I have 2 arrays and I need to switch the last digit of the integers in one array with the integers in another. Its better if I show you the output to get a better understanding of what I'm trying to do. I'm not sure this is even possible to do at the least.
Output of arrays:
first_array=['3', '4', '5', '2', '0', '0', '1', '7']
second_array=['527', '61', '397', '100', '97', '18', '45', '1']
What it then look like:
first_array=['3', '4', '5', '2', '0', '0', '1', '7']
second_array =['523', '64', '395', '102', '90', '10', '41', '7']

>>> [s[:-1]+f for (f,s) in zip(first_array, second_array)]
['523', '64', '395', '102', '90', '10', '41', '7']

If it is actual integers, you could try "rounding down" each element of the second list to nearest multiple of 10, then adding each element from the first list. For example:
>>> first = [3,4,5,6]
>>> second = [235,123,789,9021]
>>> second = [x - (x%10) for x in second]
>>> second
[230, 120, 780, 9020]
>>> [x + y for (x,y) in zip(first, second)]
[233, 124, 785, 9026]

Develop Reference

Python is a programming language that lets you work quickly and integrate systems more effectively.

How to remove letters and space from line in python? - python

Perhaps you meant to use the isdigit() method of strings: Index = ['TH', '1', '1', '2', '28', '29', '2', ''] Index1 = [x for x in Index if x] #to remove empty space Index2 = [x for x in Index1 if x.isdigit()] #to remove all letters Index3 = map(int, Index2) print (max(Index3)) Output: 29

Index = ['TH', '1', '1', '2', '28', '29', '2', ''] my_index = [int(elem) for elem in Index if elem.isnumeric()]

Index = ['TH', '1', '1', '2', '28', '29', '2', ''] print(max(map(int,filter(str.isdigit,Index)))) Filter out anything not a digit, filter(str.isdigit,Index) then map all to integers, map(int,) Then get the max. max()

Index = ['TH', '1', '1', '2', '28', '29', '2', ''] result = [i for i in Index if not(not i or i.isalpha())] print(result) Output:- ['1', '1', '2', '28', '29', '2']

Related

Python: How to convert a set of strings that contains commas represented as an element of a list to a sub-list?

Split list on smaller lists with equal elements

python all combinations of subsets of a string

Delete blank entries in CSV file Python

Integer Manipulation of arrays python

Categories

Resources