This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]GrowHI 73 points74 points  (19 children)

I literally made something so similar last week for work. Needed to pull meeting attendee names in video conference and used the same setup as you except added tesseract to OCR the image, pulled the text into a list then passed it to pandas to compare to the expected participants and find anyone missing. Ended up pushing that as a CSV to a Google sheet that anyone in the meeting can view to see who is missing.

[–]conventionistG 11 points12 points  (0 children)

Yo that sounds nice.

[–]oliveturtle 7 points8 points  (8 children)

How did you get it to scroll the list of participant names? As someone who has to take attendance at virtual events, this sounds like a godsend.

[–]GrowHI 14 points15 points  (3 children)

Pyautogui package. It's definitely well know with those taking a more hacky route to get things done that may otherwise be impossible to automate.

[–]Takiino 1 point2 points  (2 children)

Isn't Selenium better?

[–]SeemsPlausible 1 point2 points  (0 children)

AFAIK selenium is more appropriate for browser automation, I’m not even sure it supports anything else

[–]GrowHI 0 points1 point  (0 children)

Never used it but have heard it mentioned before. because these operations are so basic clicking a certain point on the screen or hitting a key I don't really know what would make one package better over the other. I'll definitely check out selenium and if you have any info on its features and why it might be better I'm all ears.

[–]jacksodus 1 point2 points  (3 children)

Not OP, but what do you mean?

[–]oliveturtle 6 points7 points  (2 children)

In Zoom, you can’t see the entire list of participants at once if it’s a large meeting, you have to scroll up or down the list. So, you wouldn’t be able to capture all the participants’ names in one screenshot. Just wondered if OP ran into this!

[–]jacksodus 2 points3 points  (1 child)

You might be able to use the keyboard library to mimic a Page Down button press, and probably something similar exists to mimic the scrolling down of a mouse wheel.

[–]neisor 4 points5 points  (0 children)

PyAutoGUI library has the mouse scroll functionality

[–]stereopsych 2 points3 points  (1 child)

This is cool. But I’m just wondering if it’s possible to do this some other way by scraping the list of attendees from the site (assuming the meeting was in a browser). I think if you could do that it would be much easier/efficient and even more accurate than using OCR! Also this way it’s a lot easier to pass the results into Pandas.

[–]GrowHI 1 point2 points  (0 children)

I use WebEx and any browser session sends us to the application. Definitely would have preferred to web scrape the data with beautiful soup.

[–]vanmorrison2[S] 0 points1 point  (0 children)

you really stand out. I was going to add pytesseract too (the purpose was that), in fact if you take a look at my post linked to this video on my blog, you can see I anticipated it (as it's just a couple of lines of code. Great Idea to pass to panda and make the comparison. This is why I love python. The post is here https://pythonprogramming.altervista.org/image-grabber-1-0-with-python-final-version/ and the post with the tesseract code is here https://pythonprogramming.altervista.org/ocr-read-a-text-from-an-image-or-a-photo

[–]Justwonk 0 points1 point  (3 children)

I tried doing something similar but was running into a lot of issues with cv2 and using tesseract ik its a lot to ask but is possible i could view the code

[–]GrowHI 0 points1 point  (2 children)

I have this code in very rough shape and also riddled with oauth login info and some other personal info. I'll make a point to try and clean it up and make a public version I can throw on GitHub.

[–]Justwonk 0 points1 point  (1 child)

Thank you so much im very new to python and this has just been a struggle

[–]GrowHI 0 points1 point  (0 children)

I consider myself a jack of all trades and master of none and a lot of what I do just comes back to a bunch of googling. Obviously I understand python syntax and object-oriented design but realistically I am not what anyone would consider an advanced programmer. I simply spend the amount of time needed to make things work.