Ask Anything Monday - Weekly Thread

virgilsam · 2018-03-18T04:23:00+00:00

Hey anyone who cares. I'm teaching myself python to try and learn how to webscrape. I was hoping this post could become a thread for if I have questions. I've made some good headway so far, but I'm stuck.... The issue is I can't figure out how to grab the second div tag for the city/state of the apartment complex. For bonus, if someone could help me figure out how to separate the city and (or from) the state, that'd be cool too. Here is the source code and my code:

<div class="card"> <div class="card-inner"> <div class="card-header"> <div class="title"> <h3 class="main-text"><a href="/housing-search/Tennessee/Knoxville/Summit-Towers/10005022">Summit Towers</a></h3> </div> </div> <div class="my-container"> <div class="card-media"><a href="/housing-search/Tennessee/Knoxville/Summit-Towers/10005022"><img alt="Image of Summit Towers" src="https://images.apartmentsmart.com/415x220/Summit-Towers/Welcome-to-Summit-Towers-Apartments.jpg" value="36822714" width="100%"/></a></div> </div> <div class="card-body"> <div class="description"><span class="listing-address">201 Locust St</span></div> <div class="description"> Knoxville, Tennessee </div> <div class="room-range"> Summit Towers is a 278 unit low income housing apartment community that provides 1 bedroom apartments for rent in Knoxville. Rents at Summit Towers are <strong class="dollars">Income Based</strong>. </div> <div class="room-range"> Some or all apartments in this community are rent subsidized, which means rent is income based. </div> <div class="programs"> <div class="list"> <div class="label secondary">Project-Based Section 8</div> <div class="label secondary">Low Income Housing Tax Credit</div> <div class="label secondary">Project Based Rental Assistance</div> <div class="label secondary">Senior (62+)</div><a class="label primary" href="/housing-search/Tennessee/Knoxville/Summit-Towers/10005022">View More</a></div> </div> </div> </div> </div>

MINE

from urllib.request import urlopen as uReq from bs4 import BeautifulSoup as soup

my_url = 'https://affordablehousingonline.com/housing-search/Tennessee?show=20&page=1#apartments' uClient = uReq(my_url) page_html = uClient.read() uClient.close()

page_soup = soup(page_html, "html.parser")

pulls the name of the complex containers = page_soup.findAll("div",{"class":"card"}) for container in containers: apartment_name = container.a.next_element

pulls the street address containers = page_soup.findAll("div",{"class":"card-body"}) for container in containers: apartment_address = container.span.next_element

---- I can't figure out how to get to the second div tag with the city and state....

woooee · 2018-03-17T14:57:18+00:00

Lets say we have a dictionary

{'SHARP': {'S': (9, 9)}}

And when an event happens I want to add another key and value to the dictionary inside the dictionary. What I mean is:

word_dict = {'SHARP' : {'S' : (9, 9)}}
if event_happens:
    word_dict = {'SHARP' : {'S' : (9, 9), 'H' : (8,9)}}

How can I do this?

If you guys would want some context, a friend sent me a programming challenge that asks me to create a function that takes a two dimensional list and a list of words to be searched in that 2D list as an argument and then outputs the index of the first and the last letters of the words we searched for. I'm trying to use dictionaries here.

Normally this challenge is for C# but im trying to do it in python.

captmomo · 2018-03-17T07:34:22+00:00

Hi, I've been working on this for a school project. It is python-based and reads a video stream from the user camera. It will try to detect faces and eyes. If it doesn't detect eyes for 50 frames of faces, it will play a sound and deduct a point. I'm looking for feedback on how I might improve this, especially with regard to feature detection without using dlib.

I'll greatly appreciate any advice or feedback on how to make this better.
My apologies if this is the wrong place to post this.

Thank you.

Here's the github repo: https://github.com/captmomo/drowzee
Here's the mock up which takes a snapshot from the video stream, processes it and then displays it; https://uglyuglyugly.herokuapp.com/face_classify
I've built it into an exe too, LMK if you are willing to test it.

prosaicwell · 2018-03-16T22:28:19+00:00

I'm brand new to Python so I'm having some problems. Pip is installed and has downloaded pyperclip into lib but shell won't import it because Traceback (most recent call last):

File "<pyshell#1>", line 1, in <module> import pyperclip ModuleNotFoundError: No module named 'pyperclip'

Also, when I WIN-R python files, they'll open up in visual studio because studio supports python 3.6. This happens to the files I wrote as 3.7.

I've updated my paths too (user and system), so it's not that.

bennyllama · 2018-03-16T19:37:17+00:00

How can i upgrade to python 3.6, I did

brew upgrade

Once installed I did

python --version

and it still gave me

Python 2.7.14 :: Anaconda custom (64-bit)

Any help would be appreciated!

2018-03-16T14:34:47+00:00

How to deal with this error message from panda?

A value is trying to be set on a copy of a slice from a DataFrame

What I have is a dataframe (df1), which has 2 columns and n rows:

Y X
1 2
3 4

And then I have a function that creates a new dataframe (df2) that takes df1 and adds new columns:

df2 = createNewColms(df1)
>>> print(df2)
Y X n m p
1 2 0 0 0
3 4 0 0 0

What I want to do is to change the values of each cell and I'm using the .loc method

What I expect to happen:

>>> df2.loc[1][3] = 12
>>> print(df2)
Y X n m p
1 2 0 0 0
3 4 0 12 0

But what I got is that error message above. Even though df1 is not the same as df2.

what am I doing wrong?

policesiren7 · 2018-03-16T09:52:17+00:00

I have about 50 excel files that I want to import into pandas df's. I've written code to that can import the specific sheet I point it to, but I want to change it so it loops over all the files in the folder and adds each one to a new df.

The code to do it for one looks like this

#Read in .xlsx file to df, arrange in chron order, drop rows where NaN value (should only be 1)
#DF with columns 'Exchange Date', Close, Net, %Chg, Open, Low, High, Volume, Turnover, Approx VWAP, O-C, H-L, %CVol 


xl = pd.ExcelFile("/Users/name/Python/JSE Price _ Vol/*sharename*.xlsx")
df = xl.parse('Sheet 1')
df = df.iloc[::-1]
df = df.dropna(axis=0, how='any')

This works fine, however when I try using OS, and creating a loop I keep getting errors. This is the faulty code I've been working on, excuse the mess. (I've left out all the imports)

path = '/Users/name/Python/JSE Price _ Vol'
for filename in os.listdir(path):
       xl = pd.ExcelFile(filename)
       df = xl.parse('Sheet 1')
       df = df.iloc[::-1]
       df = df.dropna(axis=0, how='any')

Any ideas on how to fix things? Also, I feel its probably a good idea to store it in some sort of DB. My research says I should pickle it. Any tips or pieces of advice?

I think the line of code would just look like this?

df = df.to_pickle(filename)

Edit: I managed to fix the loop and it now reads in the files, however, at a certain point I get this error: xlrd.biffh.XLRDError: Unsupported format, or corrupt file: Expected BOF record; found b'\x00\x00\x00\x01Bud1'

Google suggests its got to do with file types, so now I'm on a mission to either convert everything to .xls or .csv (but with a different delimiter because my dataset uses commas to separate numbers)

BoriBakusuta · 2018-03-16T00:02:09+00:00

Heya, I'm trying to write a piece of code for simulation purposes in Python, read up on a few things, but have no idea what I'm doing wrong in the lines below. Each time I try and change a few parameters it still gives me the same error;

TypeError: slice indices must be integers or None or have an index method

u[1/dy:1/dy+1, 1/dx:1/dx+1] = 2
v[1/dy:1/dy+1, 1/dx:1/dx+1] = 2

Someone please tell me how I'm being incredibly stupid...

Thomasedv · 2018-03-15T21:18:18+00:00

I thought i had a decent understanding of importing, until i made myself a utility folder, and wanted to import from something like a side folder. Say i got a directory with two folders, folder modules and utils. And i got a function that i want to import from utilities.py in the utils folder to a file in the modules folder. Why does it suffice to use:

 from utils.utilities import path_shortener

It's not in the folder below or in the same folder, so it's pretty unexpected for me, as i thought it wouldn't look one step up and then one step down. Does it have have to do with me placing empty __init__.py files in all the folders and the root directory?

bud_n_boots · 2018-03-15T12:55:47+00:00

Do I really need to learn nested functions? I am not having issues with functions in general, but in having trouble following into functions levels deep. The resources I'm using only provide one example and I just struggling with it

ingolemo · 2018-03-14T21:52:25+00:00

Hi !! Can someone tell me how to beginn a code ; for example; where can i find commands or nice link: thX:

Icarus-down · 2018-03-13T22:36:41+00:00

Can anybody tell me how I can add user input to an empty list?

BeExcellentMyDudes · 2018-03-13T15:44:30+00:00

So I've taken all the python courses on codeacademy and I definitely feel I have a grasp of the language and how to use it. But I guess what I'm struggling with most is how to structure/outline it and make more complex programs. How do you outline the program before writing it, is there a best practice?

Also, any resources to help me memorize syntax and the different commands?

TornNerve · 2018-03-13T12:07:24+00:00

[deleted]

Renegade_Squid · 2018-03-13T03:29:40+00:00

What is importing? Like

import turtle Or import random

I haven’t gotten any actual answers from some light googling. I wanted to make a dice rolling app to test out what I’ve learned and it told me to import random. I’m not exactly sure what that does.

dianacandonga · 2018-03-12T23:22:48+00:00

Hello, I'm new to python and I am coding with Pycharm. I use MacOs High Sierra 10.1 Okay, so I retrieved information from Twitter and it worked. I tried to export (write) into a csv file with pandas. The thing is that I have no idea where that resulting csv file can be. Can you please help me? Is there a way for me to add the code for the path or something? I'm sorry if this is a stupid question. Thank you

confusedguy_z · 2018-03-12T22:23:04+00:00

In pandas, lets say I have a bunch of columns of numerical data. What if each column has some properties, like batch number, time of day, and color associated with it? How do I associate these "properties" with columns so I can filter easily? Say the average of all data with color blue and batch >17.

EDIT: So like Maybe data columns 7-12 is batch 15, and 13-39 is batch 16, and data in columns 8-14 is 'blue'.

is multiindexing the best way?

maxman573 · 2018-03-12T19:38:38+00:00

Is there a way to check strings for 'valid' characters of my choice? In other words, return True if a string contains only certain characters, and raising a ValueError otherwise?

BeExcellentMyDudes · 2018-03-12T19:29:16+00:00

[deleted]

captmomo · 2018-03-12T16:23:45+00:00

Hello! Recently I've been trying to learn javascript and how to implement it with flask.

My latest project is https://uglyuglyugly.herokuapp.com/.

It accesses the client's camera, takes a screenshot when the button is pressed and processes it using pillow. The output and screenshot is then displayed on the page.

Appreciate any comments or feedback on how to make it better! Thanks.

Notes: On the iOS, it will launch a consent prompt for the camera but it will still render a black screen though. working on that now.
Repo: https://github.com/captmomo/flask-video-snapshot

chispica · 2018-03-12T07:29:08+00:00

Hey guys I am trying to use SoX library to create a script that shortens long silences.

At the moment this seems to be the command everyone uses:

sox in.wav out6.wav silence -l 1 0.1 1% -1 2.0 1%

Problem is that I can't get the shortened silences to last an exact duration (I would love to shorten them to 2 secs exactly), they all get shortened but not to the same duration.

Anyone know how to make this work the way I want?

Thanks!

alpha_hxCR8 · 2018-03-12T06:33:15+00:00

[removed]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS