How to transform the "multi_hand_landmarks" to "numpy array"?

How to transform the "multi_hand_landmarks" to "numpy array"? - python

I've been working on a program for hand pose recognition with Python.And there is an class named "multi_hand_landmarks" , which contains 21 3D hand-knuckle coordinates.Each coordinates is showed like this:
enter image description here
Now I'm going to calculate the coordinates of the midpoint between two joints,but I've failed to transform these data to any other type,here is my problem.

It's solved.I used to consider the "landmark" as List,but actually not.
If you want to get the x-coordinate of a joint,you should code like this:
hand_feature= hander.process(img)
for landmark in hand_feature.multi_hand_landmarks:
x=landmark.landmark[0].x # You can replace the "0" with any number you want
print(x)
It's extremely a pain in the ass that we should code "landmark.landmark[0].x" rather than "landmark[0].x" !

Related

Python - How to find the point where the slope of contour changes?

I have found the contour of the object through the function cv2.findContours and I want to know the coordinates where the contour changes slope relative to its front points, here I want that to be the starting point of the upper fin. . I would really appreciate it if you could give me a suitable suggestion. I hope you like this question
my input image : input image
image after apply threshold with value 195 : apply threshold
and i expect the result in like this : result

Perhaps you could try something like https://scikit-image.org/docs/stable/auto_examples/edges/plot_skeleton.html - maybe the resultant graph could have features indicating the location of the fin

Tiling, how to use recursion in this case?

I am trying to make an algorithm that's able to solve a big tileset in the tiling problem. Right now it's able to find the correct Tiles to place based on their width and height, but there are some problems with making it properly recursive.
As you can see the idea is that after each tile that's placed the field will be separated in a Field Right and a Field Below. The algorithm will first try to fill the Field Right and as soon as that's done it has to start trying to fill Field Below.
The problem I have is that once Field Right is solved it has to be "send back" in a way (I think using recursion, though this is quite complex) to get it to go back a Tile and go to the Field Below that belongs to that Tile. I put the idea in some pseudocode to make it a bit easier to follow.
As you can see when FieldRightWidth is solved and the FieldBelowHeight is also solved I want to make it return to the previous tile to check if FieldBelow is solved. I think that's where I need to put some code to make this work, but after hours of Googling I still have no clue.
Pseudocode:
def Methods:
create global tileset
create local tileset (without duplicates)
If globaltiles is empty:
solved
end
else:
if fieldRightWidth == solved:
if fieldBelowHeight == solved:
return True???
#FIELD BELOW
else:
Search for Tile
Place Tile
Return Methods
#FIELD RIGHT
else:
Search for Tile
Place Tile
Return Methods
And a picture of what I want the algorithm to do:
And all of the code:
http://pastebin.com/8t4PeiZP
http://www.filedropper.com/tilingnew
I'm still a newbie in coding, so any advice or help is very appreciated!

alright, let's think the area you want to calculate are either square or rectangular,(not rotated), it start from minimum [x,y] and end maximum [x,y] right, like so:
SMaxX = 5
SMinX = 0
SMaxY = 5
SMinY = 0
or if you are familiar with 2D vector you can optimize it like so:
S = [5,5]
you might know about 2D vector, just in case i explain what is vector in 2D cartesian coordinate:
S = [5,5] means, if S start from [0,0], it will end at [5,5], (simpler right?)
so boxes also will be like so:
#space each box taking
box1 = [3,3]
box2 = [2,2]
box3 = [1,1]
and since there is priority for each box, let's say:
#priority of each box
box1 = 6
box2 = 4
box3 = 2
we can merge both space and priority into dictionary like so:
#Items
dic = {'box1':{'space':[3,3] , 'priority ':6},
'box2':{'space':[2,2] , 'priority ':4},
'box3':{'space':[1,1] , 'priority ':2}}
having priority and spaces of each boxes, looks like Knapsack problem algorithm.
if you are familiar about Knapsack problem algorithm, in a table we are trying to find the highest priority that fill the space perfectly, or in other word best possible way of fitting boxes. check this link1 and link2.
however Knapsack problem algorithm's chart is 1D solution, which if you do it, you will get 10, so Box1 and Box2. but since it's 2D and you have different height and width, so the standard 1D formula wont work, maybe you need to look into it see if you can come up with 2D formula or ask around see if someone done that before.
other than Knapsack problem algorithm you can try Flood fill algorithm which is a bit slower if you have huge area, but it work just like how Tetris game is.
you need to set standard size like 1x1, and then define the whole area with 1x1 data, and store it in a variable and set each True (Boolean), then with higher priority of boxes fill the area and set those 1x1 date to False, then really easy you can check if how many of the them are True and what area are they taking.
anyway, i'm trying to figure out the same thing in irregular shape, so that was all i found out, hope that help you.
(check this link as well, i got some useful answers.)
Edit: okay, if you use Tetris idea with defining the area and Knapsack problem algorithm in one axis and then base on standard Tetris area, use Knapsack problem algorithm again in other axis should work perfectly.

Hacking a closed source program to help me sort real life objects with python/screenshot/OCR

In my job,we are using a dedicated MS ACCESS program (let's call it XYZ)to help us in our work.
I cannot access source code/API of this program, but I want to write a python script to help me automate a sorting task (which is not implemented), that I'm telling you.
We take 12 random objects/tools/pieces which are identified by a barcode with a unique id (for example 50286,50285,50277,50280 ....), we shoot them with a barcode gun scanner and in the program XYZ we get this result.
before sorting
This is a screenshot of the pc at work, I added by hand the blue colored numbers to the right just for clarity, to explain to you what I'm trying to accomplish.
Remember we took these objects randomly, now we have to sort them.
They are sorted by program XYZ keeping in mind some important sorting stuff that it isn't important to know.
This script makes two screenshot, the first before sorting the list and the second after sorting the list.
After this sorting, my list looks like this.
after sorting
I want my script to output the numbers 3,12,11,9,8,7,10,6,4,5,1,2.
I thought this was a simple task because I already managed to get for example Element_#1_in_before_list
50826 before
and Element_#11_in_after_list,
50286 after
However I cannot tell the first item is now the eleventh element because the two pictures aren't similar due to random annoying noise blue/cyan pixels (Truetype?)
I tried OCR to recognize the characters, but sometimes it fails and it's too complicated.
I tried to convert to Black and White but noise pixels become sometimes black or white and the two images don't match perfectly (I came up with the solution to perform md5sum to tell if they are the same)
How can I solve the problem?
Maybe it's simple but i'm noob.
Help me surprise the XYZ developer!!

Have you tried matching in a non-exact way? You could match each image in list 1 to the image in list 2 with the lowest MSE.
def mse(imageA, imageB):
# the 'Mean Squared Error' between the two images is the
# sum of the squared difference between the two images;
# NOTE: the two images must have the same dimension
err = np.sum((imageA.astype("float") - imageB.astype("float")) ** 2)
err /= float(imageA.shape[0] * imageA.shape[1])
# return the MSE, the lower the error, the more "similar"
# the two images are
return err
Source for MSE function: http://www.pyimagesearch.com/2014/09/15/python-compare-two-images/

Image warping with scikit-image and transform.PolynomialTransform

I attach a zip archive with all the files needed to illustrate and reproduce the problem.
(I don't have permissions to upload images yet...)
I have an image (test2.png in the zip archive ) with curved lines.
I try to warp it so the lines are straight.
I thought of using scikit-image transform, and in particular transform.PolynomialTransform because the transformation involves high order distortions.
So first I measure the precise position of each line at regular intervals in x to define the input interest points (in the file source_test2.csv).
Then I compute the corresponding desired positions, located along a straight line (in the file destination_test2.csv).
The figure correspondence.png shows how it looks like.
Next, I simply call transform.PolynomialTransform() using a polynomial of order 3.
It finds a solution, but when I apply it using transform.warp(), the result is crazy, as illustrated in the file Crazy_Warped.png
Anybody can tell what I am doing wrong?
I tried polynomial of order 2 without luck...
I managed to get a good transformation for a sub-image (the first 400 columns only).
Is transform.PolynomialTransform() completely unstable in a case like mine?
Here is the entire code:
import numpy as np
import matplotlib.pyplot as plt
import asciitable
import matplotlib.pylab as pylab
from skimage import io, transform
# read image
orig=io.imread("test2.png",as_grey=True)
# read tables with reference points and their desired transformed positions
source=asciitable.read("source_test2.csv")
destination=asciitable.read("destination_test2.csv")
# format as numpy.arrays as required by scikit-image
# (need to add 1 because I started to count positions from 0...)
source=np.column_stack((source["x"]+1,source["y"]+1))
destination=np.column_stack((destination["x"]+1,destination["y"]+1))
# Plot
plt.imshow(orig, cmap='gray', interpolation='nearest')
plt.plot(source[:,0],source[:,1],'+r')
plt.plot(destination[:,0],destination[:,1],'+b')
plt.xlim(0,orig.shape[1])
plt.ylim(0,orig.shape[0])
# Compute the transformation
t = transform.PolynomialTransform()
t.estimate(destination,source,3)
# Warping the image
img_warped = transform.warp(orig, t, order=2, mode='constant',cval=float('nan'))
# Show the result
plt.imshow(img_warped, cmap='gray', interpolation='nearest')
plt.plot(source[:,0],source[:,1],'+r')
plt.plot(destination[:,0],destination[:,1],'+b')
plt.xlim(0,img_warped.shape[1])
plt.ylim(0,img_warped.shape[0])
# Save as a file
io.imsave("warped.png",img_warped)
Thanks in advance!

There are a couple of things wrong here, mainly they have to do with coordinate conventions. For example, if we examine the code where you plot the original image, and then put the clicked point on top of it:
plt.imshow(orig, cmap='gray', interpolation='nearest')
plt.plot(source[:,0],source[:,1],'+r')
plt.xlim(0,orig.shape[1])
plt.ylim(0,orig.shape[0])
(I've taken out the destination points to make it cleaner) then we get the following image:
As you can see, the y-axis is flipped, if we invert the y-axis with:
source[:,1] = orig.shape[0] - source[:,1]
before plotting, then we get the following:
So that is the first problem (don't forget to invert the destination points as well), the second has to do with the transform itself:
t.estimate(destination,source,3)
From the documentation we see that the call takes the source points first, then the destination points. So the order of those arguments should be flipped.
Lastly, the clicked points are of the form (x,y), but the image is stored as (y,x), so we have to transpose the image before applying the transform and then transpose back again:
img_warped = transform.warp(orig.transpose(), t, order=2, mode='constant',cval=float('nan'))
img_warped = img_warped.transpose()
When you make these changes, you get the following warped image:
These lines aren't perfectly flat but it makes much more sense.

Thank you very much for the detailed answer! I cannot believe I did not see the axis inversion problem... Thanks for catching it!
But I am afraid your final solution does not solve my problem... The image you get is still crazy. It should be continuous, no have such big holes and weird distortions... (see final solution below)
I found I could get a reasonable solution using RANSAC:
from skimage.measure import ransac
t, inliers = ransac((destination,source), transform.PolynomialTransform, min_samples=20,residual_threshold=1.0, max_trials=1000)
outliers = inliers == False
I then get the following result
Note that I think I was right using (destination,source) in that order! I think it has to do with the fact that transform.warp requires the inverse_map as input for the transformation object, not the forward map. But maybe I am wrong? The good result I am getting suggest it's correct.
I guess that Polynomial transforms are too unstable, and using RANSAC allows to get a reasonable solution.
My problem was then to find a way to change the polynomial order in the RANSAC call...
transform.PolynomialTransform() does not take any parameters, and uses by default a 2nd order polynomial, but from the result I can see I would need a 3rd or 4th order polynomial.
So I opened a new question, and got a solution from Stefan van der Walt. Follow the link to see how to do it.
Thanks again for your help!

Contour plot of xyz format

I am still struggling with this guys, I have tried the suggestion on this post already. I think my brain isn't working well so I really need a dumbed down answer:
So I have data in the format of a matrix i.e.:
[x1,y1,val]
[x1,y2,val]
[x1,y3,val]
[x1,y4,val]
[x1,y5,val]
..........
[x2,y1,val]
[x2,y2,val]
[x2,y3,val]
[x2,y4,val]
[x2,y5,val]
where val is some number. Basically I want to plot a contour using this and have tried using a number of examples as a starting point but am still unsure of how to proceed. I want something of this kind:
http://www.astro.ex.ac.uk/people/mbate/Animations/Stellar/pgplot_0259_outflow.png
where the x and y are the RA and DEC. Thanks!

Develop Reference

Python is a programming language that lets you work quickly and integrate systems more effectively.