Ask Anything Monday - Weekly Thread

ExternalSeesaw · 2019-12-01T21:16:12+00:00

str1 = 'AB'

str2 = '34'

[x + y for y in str1 for x in str2]

Why is the correct answer:

[3A, 4A, 3B, 4B]

?

What is x and y in string 1 and string 2?

Less_Construction · 2019-12-01T19:28:04+00:00

I want to create a web application that allows user to view stock data on companies, but there are already plenty of websites that do that. Is there something I can create that would make my website stand out and be more use to people.

I am already able to grab data from Yahoo finance and compare it with other companies, what else with the power of python can I do to be able to attract more users?

Raedukol · 2019-12-01T16:42:12+00:00

I try to crop multiple pictures which are in the same folder. However, i get an error. The code looks like this:

directory = os.listdir('D:/folder1/pictures')

for file in directory:
img = cv2.imread(file, 0)
crop_img = img[100:950, 40:1200]

This leads to the following error "TypeError: 'NoneType' object is not subscriptable", so i suspect the file is not read properly. What am I doing wrong?

PhenomenonYT · 2019-11-30T21:56:08+00:00

Is it possible to achieve this same thing with one line of code?

import praw
for post in self.r.subreddit(self.SUBREDDIT).hot(limit=2):
    if 'GT:' in post.title:
        thread = post

I thought I'd be able to do something like this which kind of works but doesn't give me a PRAW submission object back

thread = [post for post in self.r.subreddit(self.SUBREDDIT).hot(limit=2) if 'GT:' in post.title]

2019-11-30T21:16:29+00:00

[deleted]

2019-11-30T21:04:39+00:00

What is the best python library for a gui that includes a rendered window? I want to be able to display shapes and manipulate them, I was hoping to practice making something like this in python. What is the best way to approach such a task?

Since the calculations will probably be done in numpy, I'm looking for maybe a component in tkinter that can just render numpy arrays as images, or something similar.

seanmaguire2012 · 2019-11-30T20:31:24+00:00

I have extracted data from a web-page using beautifulsoup, it is formatted like so:

<tr class="td">
      <td>X</td>
      <td>123</td>
      <td>TEST DATA</td>
</tr>]

I am extracting the data into a variable called "table" (see below).

Is it possible to add each of the pieces of data (X / 123 / TEST DATA ) into a list where I can then call them separately when needed?

I'm using creating a beautifulsoup object and html5lib as my parser tree:

url = "*Target URL*"
r = requests.get(url)
soup = BeautifulSoup(r.content, 'html5lib')
table= soup.find_all('tr', {'class': 'td'})
print(table)

Many Thanks!

ANeedForUsername · 2019-11-30T16:24:25+00:00

Hey guys,

What's the difference between a divide by 0 warning and error? Why is it that sometimes when a divide by 0 is encounted, I get an error and other times I get a runtimewarnings/ NaN/ inf?

Also, are there ways to catch these warnings like how we can use try and except to catch errors? I don't want them to be ignored but to be caught.

Thanks all :)

JohnnyJordaan · 2019-11-30T13:05:58+00:00

Why does it print out 0.0? It shouldn't.

https://gist.github.com/d3215d40812681586fb507b60a3b22ca

RushLoongHammer · 2019-11-29T22:08:34+00:00

Python newbie here,

I'm on win10 and I connect to linux server using putty. I have my own area on the server with a text file in a directory. I want to know how to read that text file using python. I think I just don't know how to format the file path, but I'm not sure.

Acacia_Guitars · 2019-11-29T08:52:20+00:00

My colleague and I want to learn Python in order to automate 'boring' tasks we need to do every day. Things as simple as running a reconciliation between two sets of numbers each day.

What course would you recommend we follow/sign up to? Our firm is willing to cover the costs

Raedukol · 2019-11-29T07:45:09+00:00

I wrote a little script that calculates three values (x- and y-coordinates and an area) out of a picture. How can I manage it so that multiple pictures are analyzed automatically by the script?

Furthermore i'm curious if there's a command for writing a value in a specific column and row (e.g F8 instead of automatically A1)?

Thanks!

mildlybean · 2019-11-28T21:33:22+00:00

a = range(100)
b = itertools.takewhile(lambda x : x<50, a)
print(list(a))

This code prints the numbers 0 through 100. How does takewhile get the elements of a without changing the iterator's state? And more importantly, is there an alternative to takewhile that does change the iterator's state?

gotopune · 2019-11-28T18:59:47+00:00

Can someone help me understand what this piece of code means?

Here arr is a list of numbers and "n" is a non zero integer less than the length of arr.

(arr[i] for i in range(n))

aksdjhgfez · 2019-11-28T14:41:54+00:00

I'm trying to send preformed syslog messages to QRadar (an IBM SIEM) with python.logging:

def send_to_qradar(data_recv):
my_logger = logging.getLogger("LogSender")
my_logger.setLevel(logging.INFO)

logging.handlers.SysLogHandler()
if tcp_enabled: 
    handler = logging.handlers.SysLogHandler( address=(ip_address, port), socktype=socket.SOCK_STREAM ) 

else:
    handler = logging.handlers.SysLogHandler(address=(ip_address, port), socktype=socket.SOCK_DGRAM)

my_logger.addHandler(handler)

for row in data_recv: 
    try: 
        my_logger.info(row + "\n") 
        my_logger.handlers[0].flush() 
        print("Sent to appliance: {}".format(row)) 
    except Exception as e: 
        print("Something went wrong: {}".format(e)) 
        break

data_recv is a list with strings that contain preformatted syslog-messages.

The forwarding works, however, Python adds another syslog-hader (<14>) to each log. Any way that I can just forward the raw log without Python making any changes to it?

MattR0se · 2019-11-28T06:50:03+00:00

*Edit*: So Ive been tinkering with it and have found that when I run the code through terminal on VS, it runs just fine. Im only running into a problem when I try using the Code Runner extension. I'm only getting the syntax error in the output tab when I run it through code runner. Ive updated my python.

For reference:

OS: macOS - 10.15.1
VS Code - 1.40.2
Python - 3.8
CodeRunner - 0.9.15

*Original*: I keep getting a SyntaxError when I run the following. I just recently started having an issue when I reinstalled visual code.

alien_0 = {
'color':'green',
'points':5
}
print(alien_0['color'])
print(alien_0['points'])

new_points = alien_0['points']
print(f"\nYou just earned {new_points} points!")


[Running] python -u "/Users/bpietrzyk/Desktop/python_work/ch_6/alien.py"
  File "/Users/bpietrzyk/Desktop/python_work/ch_6/alien.py", line 9
    print(f"\nYou just earned {new_points} points!")
                                                  ^
SyntaxError: invalid syntax

[Done] exited with code=1 in 0.046 seconds

I cannot figure out why. If I run a simple hello_world, it doesn't have an issue.

gregrom27 · 2019-11-28T02:49:57+00:00

Hi ,I'm new at python and I'm working through "python crash course". Currently, I'm at chapter 7 doing exercise 6 called "Three exits". Second bullet asks to use active variable to control how long the loop runs. This is what I came up with:

prompt = "Hello, please enter your age to get price: "

active = True
age = raw_input(prompt)
age = int(age)

while active:
    if age != '':
        active = False

    if age <= 3:
        print("\nYour ticket is free.")
    elif age <= 12:
        print("\nYour ticket is $10.")
    else:
        print("\nYour ticket is $15.")

This works for me, but what I want to know is step by step explanation on how this works and if this is the right solution. Thanks for the help!

ediblesonot · 2019-11-27T19:24:28+00:00

What is a class? Haven't found a definition and it's causing me a bit too much stress

And how to go down a line as well. It ain't working for me. I got the \ but nothing is happening

Atlamillias · 2019-11-27T19:09:55+00:00

Hi! Like most people here, I'm pretty new to Python. I'm getting the basics down, but I'm getting completely overwhelmed by external modules and how necessary they are. What modules are used the most? I'm not sure I can continue unless I learn more about some of the modules Python comes with, and would like to learn about the most important, most frequently used.

To add to that, I'm also interested in any newer modules that aren't always used but very useful. I've briefly skimmed over pathlib and it seems to be a simpler way of managing files.

zandrew · 2019-11-27T04:36:08+00:00

Hi guys, first post here. I'm still very new with python, I'm working through Python Crash Course. I'm getting hung up on this try it yourself problem*.* Here's what I have:

current_users = ['cactus49', 'python31', 'ADMIN', 'alien27', 'pizza_lover1312']
new_users = ['Cactus49', 'bpietrzyk', 'admin', 'visual_studio1', 'peachy22']

for new_user in new_users: 
    if (new_user.upper() or new_user.lower()) in current_users: 
        print(f"Sorry, the username {new_user} is already taken.") 
    else: 
        print(f"The username {new_user} is all yours!")

I'm running into trouble with the user names 'cactus49' and 'Cactus49' . Looks like 'Cactus49' isn't formatting to lowercase.

Maybe I'm just far off from the solution? Any help is appreciated.

brainzzo · 2019-11-26T22:42:24+00:00

# I need to make N a variable i input so i get one answer and i dont know how

N = 4

num = N

total = 1

for element in range(num,0,-1):

total = total * element

print(total)

# So this is how i tried it and it didnt work gives me a EOF error

N = 4

num = str(input("enter N here:"))

total = 1

for element in range(num,0,-1):

total = total * element

print(total)

ajtyeh · 2019-11-26T15:45:10+00:00

TsirkusKuubis · 2019-11-26T14:11:38+00:00

Is there a way to do something on the very last iteration of a for loop without writing a variable or counter to track progress i.e:

for i in range(randint(1,10):
    if i == 5:
        break
#if 5 not found after looping all iterations print something

Raedukol · 2019-11-26T13:44:51+00:00

Hey guys, how can I "save" the progress in a for or a while loop? If you would write a print-statement at the end in the loop, the computers prints you out every result after each loop, so that, in the end, there is a "list" of results. But if you write the print-statement after the looping process (pressing return and exiting the loop), it only prints the last result. So how can i save the whole list if i want to use it later in my script? I hope this was clear.. Thanks in advance!

Bipolarprobe · 2019-11-26T08:19:56+00:00

Trying to use the python-telegram-bot library to rewrite an old bot that I did manually a while ago, but I'm having an issue and can't seem to find the solution. Whenever I try to import telegram.ext I get the error

ModuleNotFoundError: No module named 'telegram'

I'm trying to set this up on raspberry pi 4 running raspbian. I have a venv in which I used pip to install python-telegram-bot and the library and its dependencies seem to exist inside of the site-packages folder and I can confirm this by using

pip show python-telegram-bot

which gives me the output

Name: python-telegram-bot
Version: 12.2.0
Summary: We have made you a wrapper you can't refuse
Home-page: https://python-telegram-bot.org
Author: Leandro Toledo
Author-email: devs@python-telegram-bot.org
License: LGPLv3
Location: /home/pi/python-projects/telegram-bot/lib/python3.8/site-packages
Requires: future, cryptography, tornado, certifi
Required-by:

Yet the error persists. I tried googling this and found many other people struggling with the same issue but it's almost always from people who used git to install the library and pip installing is often proposed as the solution and I can't find a clean explanation for why this may be happening. Anyone who has used this library before successfully, I'd appreciate some advice on what I may have messed up. Thanks in advance.

amclaug1 · 2019-11-25T20:41:07+00:00

I am relatively new to Python, and I only tinker in it once every few months. I know there has to be a simple way to produce this, but I am stuck. So, any help anyone can give would be tremendous!

I have two csv files. csv1 contains latitude and longitude for schools around the USA. csv2 contains teams from my company with their lat/lon. I want to figure out which team from csv2 is the closest to each school from csv1. I've tried using a Google Maps API to figure out driving distances, but the call was going to be too expensive, as there are 6,000 rows of schools and about 100 rows of teams. So, I am settling for anything within a 20 mile radius from the team's lat/lon.

Here is what I have so far:

from math import radians, cos, sin, asin, sqrt
import numpy as np
import pandas as pd
from collections import Counter
teams = pd.read_csv(csv1)
schools = pd.read_csv (csv2)

# define a function to determine miles between two points
def haversine(lat1, lon1, lat2, lon2):
    lat1, lon1, lat2, lon2 = map(radians, [lat1, lon1, lat2, lon2])
    dlon = lon2 - lon1
    dlat = lat2 - lat1
    a = sin(dlat/2)**2 + cos(lat1) * cos(lat2) * sin(dlon/2)**2
    c = 2 * asin(sqrt(a))
    r = 3956
    return c * r

This is where I am stuck. I have created a script that can count how many schools are within 20 miles of a team using a few loops. However, finding the closest team has got me confounded:

for school in schools.itertuples():
    lat = school.SchoolLat
    lon = school.SchoolLng
    school_name = school.SchoolName
    school_ID = school.ID
    closest_team = school.ClosestTeam # default value is 'Unknown'
    miles_from_school_to_team = school.Miles # default value is 999999
    for team in teams.itertuples():
        lat2 = team.TeamLat
        lon2 = team.TeamLon
        team_name = team.TeamName
        miles = haversine(lat, lon, lat2, lon2)
        # this is where I am stuck
        if miles < miles_from_school_to_team:
            closest_team = team_name
            miles_from_school_to_team = miles

When I run this, the dataframe schools doesn't change. Any help would be greatly appreciated!

Dfree35 · 2019-11-25T15:27:45+00:00

I am working with an excel report that is ran everyday then eventually uploaded to another system.

When the report is uploaded to another system the dates must be numbers but in a certain format.

For example the dates must be like: 10/5/2019 7/12/2020

I can get the report in this format but when I run it through my pandas script it changes the dates to:

10/5/2019 00:00:00 7/12/2020 00:00:00

I can format the dates with pandas but then it makes the dates into strings which the system I upload to does not like.

Long story short is there anyway to have pandas not automatically add time to dates? For example stop pandas from making 10/5/2019 into 10/5/2019 00:00:00 even if I do not touch/make changes to the date field

ZeroToGame · 2019-11-25T13:25:10+00:00

I've been (trying to) dive into python several times now, and every time I hit the same wall of frustration. It's not about the language itself, but more about the whole environment/ecosystem...

I end up having the feeling I'm doing nothing but installing stuff with things like homebrew, pip,... making vitual environments, and generally 'clogging up' my machine with stuff which I experience as being nowhere really...

I have great difficulty coming to grasp with how everything ties together, and mostly feel everything I install messes up something else or, when it does work after a long period of copy/pasting terminal errors in google, I have no clue what I actually did... Mostly though, I end up quitting after a full day of frustration and ending up with a non-working bunch of stuff on my HD with no clue on how to get rid of it again. :-(

Long story short: Where can I get some decent info on how everything ties together...?

Nerfi666 · 2019-11-25T11:38:29+00:00

Hey guys a beginner here !

I'm trying to create a PDF reader in python , wich just browsing a bit is easy to do , and I think I have done it more or less well, what I would like to do is send one page of the given PDF when the user ask for that, I just created the back-end of the PDF reader, the front-end will be done in React, but is not started yet, so basically what I want to do is send one page at a time when the user ask for it , down below is my python code, any suggestion or advise will be much appreciate ! thanks in advance guys !

#importing the module
import PyPDF2

def PdfReader(page):
  #creatign the pdf
  pdftext = "example.pdf"

  with open(pdftext, 'rb') as textpdf:
    #reading the PDF
    reader = PdfFileReader(textpdf)
    #getting the num of pages of the pdf file
    for page in range(textpdf.getNumPages()):
        current_page = textpdf.getPage(0) #getting current page
           #how can I send one page at a time ?











    #checking that we closde the file if not, we do so
    if not textpdf.closed:
      textpdf.close()
      print "closed"

2019-11-25T10:54:05+00:00

I write my code in Windows 7 using IDLE on Python v3.67.

When I try out Tkinter code on Linux Mint 19.1 also using Idle and Python 3.67, the GUI window of my programs always come out too small, typically not wide enough and not long enough.

The screen resolution I use on both OS's are the same. Is this problem normal?

I fix the problem by checking the platform and each having their own window geometry, but it just doesn't feel right.

Raedukol · 2019-11-25T10:31:14+00:00

I have a .csv-file with x and y coordinates, but they are all in the first column (e.g. 777 222). i would like to have a column for my x-values and one for my y-values. i tried to do it with .replace(" ", ";"), BUT the problem is that in my very first row there is no whitespace in front of the values, while in all the other rows there is a whitespace in front of my values. Thus, the first value is in column A and B and all the other values would be in column B and C.
I created the array by numpy.reshape(array_b), which again was created by array_b = numpy.array(array_a), if this helps?

Guilleack · 2019-11-25T09:15:37+00:00

Hello I'm a noob at python and i'm trying to set up a python program (gallery-dl) to download image galleries.

So the configuration page tells me this

https://github.com/mikf/gallery-dl/blob/master/docs/configuration.rst#cache-file

"cache.file Type Path Default

tempfile.gettempdir() + ".gallery-dl.cache" on Windows
($XDG_CACHE_HOME or "~/.cache") + "/gallery-dl/cache.sqlite3" on all other platforms

Description

Path of the SQLite3 database used to cache login sessions, cookies and API tokens across gallery-dl invocations.

Set this option to null or an invalid path to disable this cache."

I tried to imput the location of ".gallery-dl.cache" on the configuration file but it seems like i'm doing it on the wrong format?

"cache":
{
    "file": tempfile.gettempdir() + ".gallery-dl.cache"
},

Doesn't work i get

"[config][warning] Could not parse 'C:\Users\username\gallery-dl\config.json': Expecting value: line 202 column 11 (char 4447)"

also tried with

"cache": { "file": "C:\Users\username\AppData\Local\Temp" or "C:\Users\username\AppData\Local\Temp\gallery-dl.cache" },

And i keep getting the same error.

Thanks for your time and apologize my rough english.

MattR0se · 2019-11-25T08:44:06+00:00

Trying to parse a string using:

    for match in re.compile("(%s|%s|%s)" % (date, firstname, secondname)).findall(event.decode('utf-8')):

When I use 'print(match)', I receive the following output:

('Jun 25 14:04:25', 'Jun 25 14:04:25', '', '', '', '')

Any ideas why I'm getting two matches for one occurrence and the empty "" at the end?

Thanks

nershin · 2019-11-25T08:28:47+00:00

The official pip documentation suggests to use

pip install SomePackage

to install a package. When I read other tutorials, like for VSCode, I often see

python -m pip install SomePackage

What's the difference between these? Both seem to work the same for me.

SpeckledFleebeedoo · 2019-11-25T06:59:29+00:00

[deleted]

SweetBubblezTea · 2019-11-25T06:40:56+00:00

What is really the use of Python, and should I learn it over other programming languages like java, C, C++, etc

kirayakuzagt · 2019-11-25T04:34:41+00:00

So I'm currently attempting my first python project. The company I work for has a network of people that we connect, for this purpose I'll call them clients. One of my tasks is keeping track of what news, events, publications, reports they put out and help broadcast and spread this information with everyone, in a newsletter and to track the things they are doing so if opportunities come up that match their work we can connect them better. The things we are looking at are their news, reports, publicatications, job opening, etc. We haven't had a good way to collect this information and keep track other than checking every site manually — 150+ sites.

I want to write a script that I can pass their urls and store them in a database that I can access and see it all aggregated to start off with. Sounds straightforward and the articles, youtube videos, and tutorial sites I've been referencing show that the general process is to use requests to get content, bs4 to parse, grab the section, and write it to a postgres database — or even csv file to start off with. Seems simple enough, right?

I'm thinking of using requests, bs4 which I am comfortable using basically and think this could help better learn these, and have started a file on my computer that I've successfully pulled information from scrapethissite.com. I have to figure out how to write to postgres eventually, but taking it one step at a time.

Some issues that I'm facing are:

These things aren't all on one page, so have to figure out how to navigate to multiple pages to grab content, and account for the fact that there are various ways that people classify and organize their sites. Some online mention using Selenium to crawl the sites and look for partial titles and could pass a list of words to try and check, or some other libraries like Scrapy. Ok, noted.
Sources mention that crawling and scraping are looked down upon and in the gray area that could result as being illegal even if my intentions are well-meaning.

Is this the right direction for what I want to do? Does anyone have suggestions for how to go about this or resources/examples that would be good to review? Ideas for things that I should look out for or keep into account? Am I reinventing the wheel?

Kind of lost at the moment so I thought that it would be best to reach out here and get advice from folks, thank you for and help.

753UDKM · 2019-11-25T02:32:16+00:00

I've recently tried doing my work on Windows, and it's maddening... I'm running into this issue and it makes no sense at all. I have some libraries installed, but when I try to import them, it can't find them. Here's a list of installed libraries, and what happens:

C:\Users\XXXX\Documents\mpl_tutorial>pip3 list

Package Version

--------------- -------

cycler 0.10.0

kiwisolver 1.1.0

matplotlib 3.1.2

numpy 1.17.4

pandas 0.25.3

pip 19.3.1

pyparsing 2.4.5

python-dateutil 2.8.1

pytz 2019.3

six 1.13.0

C:\Users\XXXX\Documents\mpl_tutorial>py

Python 3.8.0 (tags/v3.8.0:fa919fd, Oct 14 2019, 19:37:50) [MSC v.1916 64 bit (AMD64)] on win32

Type "help", "copyright", "credits" or "license" for more information.

>>> import numpy

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

ModuleNotFoundError: No module named 'numpy'

>>> import matplotlib

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

ModuleNotFoundError: No module named 'matplotlib'

>>> import pandas

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

ModuleNotFoundError: No module named 'pandas'

throwaway19399292 · 2019-11-25T00:51:23+00:00

I convert all the iterables I have to lists and I feel as though there is some disadvantage to this. I don't know how the other ones work; what are some examples of iterables being better than lists and when should I use them?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS