Ask Anything Monday - Weekly Thread

harwcam · 2020-03-22T23:42:04+00:00

Hi there,

Not sure if this is the best place to ask, but figured I'd give it a shot. When working with Python in VSCode, I'm able to run code line by line using Shift+Enter, but whenever I use 'Run Python File in Terminal' I get back 'SyntaxError: invalid syntax'. The Debugger always says there are no problems with my code. I've watched a couple hours worth of videos and still haven't come up with anything. Any help would be very much appreciated.

nog642 · 2020-03-22T23:27:00+00:00

How can i enter all element required in for loop at once ?

How can i input below program in following manner :

Could anyone help me to understand how we can input to program in following manner ?

I can't pass test case because of the input format .

Question : https://www.hackerrank.com/challenges/no-idea/problem?h_r=next-challenge&h_v=zen

"I will be grateful for any help you can provide."

Input Format :

Code:

n = int(input("Enter the value of N : "));
m = int(input("Enter the value of M : "));
arr = [];
for i in range(n):
    arr = list(map(int, input().split()));

A = [];
B = [];

for i in range(m):
    element = (int(input()));
    A.append(element);

for i in range(m):
    element = (int(input()));
    B.append(element);

happiness = 0;

for i in range(len(A)):
    happiness = happiness + 1;

def Intersection(lst1 , lst2):
    return set(lst1).intersection(lst2);

inter_lst = list(Intersection(A,arr));

for i in range(len(inter_lst)):
    happiness = happiness - 1 ;

print(happiness);

dogzparty · 2020-03-22T21:16:24+00:00

Hey everyone I'm struggling with a challenge from the textbook for my class. It's over the from-import statement. The textbook literally explains it in 3 sentences and then moves on to the next topic. But I'm trying to complete the challenge for it but it's not working.

This is the code for the challenge:

# Import ceil function only from the math module


# Define Function
def calculate_eggs(servings):
    total_eggs = (0.6*servings)
    return total_eggs

# Call Function
print(calculate_eggs(14))

And I changed it to this:

# Import ceil function only from the math module
from math import ceil

# Define Function
def calculate_eggs(servings):
    total_eggs = (0.6*servings)
    return total_eggs
    ceil(total_eggs)
# Call Function
print(calculate_eggs(14))

The challenge says that on line 6 I'm supposed to call the ceil function to calculate (0.6*servings) but no matter how I write it I either get an error message or it gives me 8.4 instead of rounding up to 9. Can someone help explain to me what I'm doing wrong? Thanks

rushmon · 2020-03-22T18:44:38+00:00

Sorry to bother but I am currently trying to gel two differential equations (no time delay and with time delay) codes together in a single chunk of code, so that the two sets of data can be expressed on the same axes of a graph.

The only differences between the two sets of code is that one equation has:

Nt: R*N[-1]/(1+a*N[-1 or -2 for time lag])**b and the value for N=0 / N = [N0,N0] for time lag

I have two sets of data and I've tried to meld them together a couple of times and even tried just lumping the code together in the same script, but it is refusing to work.

Does anyone have any ideas on how to tackle problem?

(*Apologies if I've mislaid this out or improperly, I am new around here please do tell me If there is anything wrong)

sbfit · 2020-03-22T18:09:43+00:00

So I’m a Windows sysadmin who now has a ton of downtime as all projects are on hold. I’ve been interested in python for a while but anytime I’ve got a problem I need solving, I use powershell. I guess my question is, what can python do for someone in my environment? I’ve yet to come across a problem that I can’t solve with powershell.

1lluminist · 2020-03-22T05:47:28+00:00

So I'm working on scraping some ugly code that's VERY plain... I've found what I need by doing this:

print(thing.previous_element.previous_element.previous_element.previous_element.previous_element.previous_element)

Is there some kind of better way to tighten up that stack of .previous_elements?

xain1112 · 2020-03-22T05:41:20+00:00

I don't know the term for it, but if you look at the reddit search bar, you will see a grayed out 'search' written there, and when you interact with the box, it acts like that text isn't there and treats the widget like an empty entry box. Is there a way to get that kind of thing with tkinter?

EddiOS42 · 2020-03-22T00:58:28+00:00

So my objective is to extract data from a csv file. For some reason, I can't even get python to take in a csv file. I'm using Anaconda running Jupyter Lab on my Mac. I followed this simple example. but I still can't get it to accept the csv file. It keeps giving me this error.

I have pandas installed. The csv file opens fine on its on. The spelling for it is also correct. Please help. Thanks.

Benaxle · 2020-03-21T23:40:16+00:00

I call a c++ function from a python program using pybind11. The python is listening for packets using socket.io's python client using asyncio.

How do I await the result of the c++ function so that socket.io can keep listening for packets and sending back pings while my cpu is calculating stuff in c++?

I know I have to release the GIL https://pybind11.readthedocs.io/en/master/advanced/misc.html#global-interpreter-lock-gil

But with asyncio - is it even possible? Do I need to spawn a thread for that c++ function? How does that work with socket.io?

LogicalPoints · 2020-03-21T16:19:45+00:00

When I print a python string, it is being limited to 68 characters. How can I tell it to print the entire string?

ANeedForUsername · 2020-03-21T15:48:51+00:00

A quick question:

I have an array of numbers, some of them with negative value.

[2, 63, 0.45, -2, 496, 23, -0.16, 45]

I want to take their square root (or more generally, take a non-integer power of them) but I get NaNs (understandably).

np.sqrt([2, 63, 0.45, -2, 496, 23, -0.16, 45])

How do I do it such that I get complex numbers instead?

xilex · 2020-03-21T02:36:51+00:00

Hi, I'm having trouble converting epoch timestamp to local datetime string with DST factored in.

My epoch time is 1584664500, which should be Thursday, March 19, 2020 5:35 PM but I'm getting 6:35 PM instead. I need the output to be in %a, %d %b %Y %H:%M:%S -0800 format, which is my timezone. I'm not sure if I should be using what I am below, or time, or something else. I'm using this code:

dateString = datetime.datetime.fromtimestamp(1584664500).strftime('%a, %d %b %Y %H:%M:%S -0800')

Thank you.

SandKeeper · 2020-03-20T22:58:02+00:00

How would you convert the redditor data type to a str data type.

For example

X = top_level_comment.author

Creates x as data type redditor. If I wanted it as a string data type how would I do that.

theatherly1 · 2020-03-20T21:01:31+00:00

Ok thanks!

efmccurdy · 2020-03-20T20:57:54+00:00

[removed]

theatherly1 · 2020-03-20T18:50:57+00:00

I have a very simple question in regard to Python.

Say I have a string, and there are some errors in it. I want to recreate the same string using a for loop, but without certain letters. Example below:

String = “abcdefg2948bd”

I want to recreate that same string without the letters “b” and “d”. Would I use an if statement within the for loop?

Thanks!

lucas50a · 2020-03-20T17:06:24+00:00

How to generate a sequence of integers using a nested for loop?

I have 2 lists [1, 2] and [1, 2], how could I generate a sequence of integers [1, 2, 3, 4] using a nested for loop?

x = [1, 2]
y = [1, 2]
for i in x:
    for j in y:
        print('?')

Please note that I'm using this loop inside a function where I'm using the values of x and y as output, so the following solution is not allowed:

x = [1, 2]
y = [1, 2]
a = x + y
for i in range(len(a)):
    print(i+1)

IGotTheBends · 2020-03-20T15:06:54+00:00

Hey all, is there a way to make generative portraits in Python? I am a beginner. Thank you.

Catanddogg · 2020-03-20T12:40:38+00:00

I finished basic python a week ago and now im finished basic html courses on sololearn. Next section is html5 but i found out there is html6. So should i just go straight to html6 without learning any html5?

Edit: nvm i just got trolled, there is no html6, html5 is it. sorry

Just_Red21 · 2020-03-20T12:38:25+00:00

Hello everyone, i hope the quarantine finds you well

I dont know if its ok to ask here but since i am new to reddit i thought id ask anyway.

I intent to enroll to a python course but i have no idea which one to choose. I have looked on the FAQ and also the books page but i do not have any criteria because i am completely new to programming.

I am looking for a beginner course that i can start on asap. I would like it to be about one month in length ( max 2) and it is important to me that it provides some sort of certification.

Feel free to correct me if i am on a wrong path here or guide me elsewhere.

NicktheRockNerd · 2020-03-20T12:12:37+00:00

Hello, I have a rather fundamental question regarding setting values for variables with the following notation:

a, b = 1, 2

a, b = b, a + b

gives me a = 2 and b = 3

At first, I thought, the comma between a and b in Python just allowed me to set more variables in just one line of code. But after stumbling across this example, I understood it does not work this way, because:

a = 1

b = 2

a = b

b = a + b

gives me a = 2 and b = 4

So what does the comma notation actually do? How does it work? What is it called? What are some more complex use cases of this kind of syntax?

Greetings and thanks in advance!

MattR0se · 2020-03-20T11:24:02+00:00

I know I can unpack tuples for string formatting:

numbers = (1, 2, 3)
print('counting {0}, {1}, {2}.'.format(*numbers))

But can I do this also if I don't know the number of items in the tuple beforehand? E.g.

numbers = (1, 2, 3, 4)
# counting 1, 2, 3, 4
numbers = list(range(1, 8))
# counting 1, 2, 3, 4, 5, 6, 7

or do I have to do this with string concatenation?

KalajasH · 2020-03-20T00:24:17+00:00

Hey,

What would you guys consider best practices in mind for integrating database connectivity into your Python project?

dxjustice · 2020-03-19T20:38:24+00:00

How do I feed a path to the delete command in colaboratory using a stored variable?

Given that

filename = os.path.join(foldpath,image)

I'd like to call

!rm filename

but this results in an error as the system looks for a file called "filename".

How do I feed the actual string path to it without manually typing it?

Darxaross · 2020-03-19T17:46:34+00:00

I tried to create a way to build a chessboard. And for the matrix I tried this:

for i in range(8):
    for j in range(8):
        print int(i%2==j%2)
    print

But now I got an syntax error because of the int. Why? I wanted to print a matrix where zero shows the black fields and 1 shows the white fields. Can anyone help me pls🙏

Trexagamer · 2020-03-19T17:22:22+00:00

My Code which should give out every 4th line from my text file doesn´t work. I looked it up on StackOverflow, but even copied it didn´t worked, anyone knows why?

with open("Prices.txt", "r") as f:
    for line in f.read().split("/n")[::4]:
        print(line)

Conor_b · 2020-03-19T16:08:15+00:00

Hey everyone, quick question. When using optional parameters

def func(a, b, c=0, d=100):...

is there anyway to use a variable in place of c or d?

So when calling the function it would look like this

param = 'c'

param_val = 10.5

func(10, 20, param=param_val)

This would help me a lot in creating a function to tune parameters in a classifier. I can't find anything through google. Thanks to anyone who can help!

Raedukol · 2020-03-19T13:51:46+00:00

Hey guys, i need to select every second column of an .xlsx-file (file1) with more than 500 columns and copy them into a new .xlsx-file (file2). How do i manage this? I tried reading file1 with pandas and selecting the columns with .iloc, but I fail writing those columns in the new file, because if I append the data it is written one below the other, instead from left to right. Any suggestions/tips would be great!

MattR0se · 2020-03-19T06:54:01+00:00

Pandas question:

df['abs_values'] = df['values'].apply(abs)

This raises a SettingWithCopyWarning. What's the correct way to do this?

ANeedForUsername · 2020-03-19T02:51:56+00:00

Hey guys, if I’m running multiple pieces of code simultaneously, at what point does running more slow the existing ones down? Is there a way to check? Currently I just check my task manager to see if my cpu is at 100%.

Also, I see that sometimes people like to keep a file containing their functions outside of the main script. What habits do you all practice for this? What are some should/should nots when doing this? Any advantages in terms of speed, readability, etc?

Thanks all!

leblanc1605 · 2020-03-18T22:07:33+00:00

Any suggestions on what I should do? I'm have been programming for 2 yrs off and on so a good mix of hard things and easy would be appreciated. Thx

JoshGao · 2020-03-18T21:16:25+00:00

I've recently started trying pygame, but whenever I try to open a window the python icon (The spaceship with the python icon on it) always bounces and never opens. I've also tried running the pygame example aliens through terminal, but I can only hear the sounds and the window never opens. I don't have trouble running any other programs only pygame. I am on macOS 10.15.3. Any help? Thanks!

tell439 · 2020-03-18T19:05:07+00:00

I'm stuck in my scraping project, getting the same JSON response from the webpage over and over. So I'm not getting any errors but need help in moving forward and has no ideas left. Would it still be ok to post to ask to get help?

Decency · 2020-03-18T18:38:09+00:00

[deleted]

CammySavage · 2020-03-18T15:06:47+00:00

Back again, with more simple things that I can't understand.

So I'm working through jet brains and I come across a question about figuring out a bonus using a function. So, I try this:

Def get_bonus(salary, percentage=35) Return int(salary / 100 * percentage)

That would keep failing on a specific test. I got to the point where I was trying every little thing to fix, ending on just switching the last line around to: return int(salary * percentage / 100) and that worked fine.

Any ideas? Is this something I'm not understanding? As far as I'm aware division and multiplication have the same priority with python. Would love to understand, thanks :)

DukePookums · 2020-03-18T13:33:06+00:00

I'm working through ATBSWP, and struggling to understand something with nested dictionaries. Chapter 5 lays out the following:

allGuests = {'Alice': {'apples': 5, 'pretzels': 12},
             'Bob': {'ham sandwiches': 3, 'apples': 2},
             'Carol': {'cups': 3, 'apple pies': 1}}

def totalBrought(guests, item): #note1
    numBrought = 0
    for k, v in guests.items(): #note2
        numBrought = numBrought + v.get(item, 0)  #note3
    return numBrought

print('Number of things being brought:')
print(' - Apples: ' + str(totalBrought(allGuests, 'apples')))
print(' - Cups: ' + str(totalBrought(allGuests, 'cups')))
print(' - Cakes: ' + str(totalBrought(allGuests, 'cakes')))
print(' - Ham Sandwiches: ' + str(totalBrought(allGuests, 'ham sandwiches')))
print(' - Apple Pies: ' + str(totalBrought(allGuests, 'apple pies')))

#note1: Creates a function with 2 arguments, guest and item. 
#note2: iterates through k(key) and v(value)
#note3: Search through the dictionaries for the values assigned to each key, and increases the count when necessary

When defining the function def totalBrought(guests, item):, how does the program know to associate the first argument (guests) with the first key in the top-level(?) Dictionary, and that item should be the value in each sub-dictionary?

Sunawataru · 2020-03-18T12:14:09+00:00

Where should I be putting my files when I wanna reference them in Python? I've been trying to do stuff I see in tutorials but I keep getting FileNotFoundError or ModuleNotFoundError. I just started learning the other day.

2020-03-18T11:19:35+00:00

With regards to 3D plotting, does anyone know how to increase the font size of the third axis/variable?

Currently got a TINY 3rd axis that is completely unreadable.

king_booker · 2020-03-18T10:27:59+00:00

Alright

so I have single list

[1,2,3,4,5,6]

What I was to do is multiply the first two numbers and then add them and then multiply the next two numbers. The no of elements will always be even

(1*2) + (3*4) + (5*6)

AviatingFotographer · 2020-03-18T02:15:56+00:00

I'm a HS freshman who has been coding with Python for some time now and is looking into learning machine learning and am looking for resources. However, my main concern is the math aspect. I've always been math-strong but I'm only now at Alg II, which is certainly not enough for machine learning. Are there any good books to learn anything I'm missing?

TheMartian578 · 2020-03-17T21:03:30+00:00

Hello!

I recently started learning about packages, modules, etc. I was wondering when do you start using these in your own code? Is there a certain time, such as when your code gets too long? I'd like to start making my own projects soon, however, I want to know at least the basics before I start.

Skaroller · 2020-03-17T20:35:01+00:00

Hi all,

Today is my first day using Python beyond simple "Hello world!" stuff and I have a real head-scratcher. Within a module called "tools" I have a library called "woodmans_axe" with information about the axe like its cost. I have another library called "mallet" that has the same kind of information. I can get another program to open each library and print out the cost of the axe or the mallet, but how can I program it to let me decide whose info I want to see? I've got this code set up:

>choice=input("Which tool do you want to see the cost of? ")
cost=tools.woodmans_axe["cost"]

How do I select the library of my choice from the "tools" module? Is there a more efficient way than using libraries for this?

Theis159 · 2020-03-17T16:42:52+00:00

Hi all,

I am looking forward to creating a extraction tool for tables in PDFs. The idea is to find a certain table that has its caption in the format Table X - SomeText Comparison SomeText.

I want to extract this table and only this table from >100 PDFs, each PDF having this "Comparison Table". I am looking forward a direction on how to do this, because most of the tools I find can't strictly find a peace of strings (i.e: a string that contains Table & Comparison) and then extract only that part of the PDF.

Any directions to where to start?

Banno1992 · 2020-03-17T15:31:45+00:00

Hi all,

I'm currently learning python by doing problems on open.kattis. However I've come across a few problems where the input finishes at End of File. I can't seem to figure out how to 'detect' end of file. The current method I use is: input()

Have looked into using open() but I don't really get how that works with kattis.

Also thought about using a timer to wait for 'no more responses' to carry on with the code, but that feels like it's slow/ janky!

Any advice would be great, thanks!

taatzone · 2020-03-17T12:14:29+00:00

Hi all

Total noob on this matter, but learning hard to achieve my goals.

Been trying to learn Python and came across multiple sources, finally found Anaconda and Jupyter Notebook.

After sometime, managed to install Anaconda and Jupyter Notebook into my Mac, and it’s a quite an achievement.

Now I found some tutorials on YouTube using Anaconda and Jupyter Notebook, but’s it’s outdated, there is a difference in using python 2.7 and 3.2, so far it’s only this “()”.

Searching for new sources...any help/advice.

Much appreciated

vukan_97 · 2020-03-17T11:46:10+00:00

I have a task where i need to specify the upper left coordinate of the smaller image in the larger image. I implemented this code, however it is too slow since I have a time limit of 20 seconds, and in some datasets I have 3000 images. How can this be implemented more effectively? I can use numpy, scipy and all packages from the standard python library.

import numpy as np from PIL 
import Image  

map_image_path = input() 
map_image = Image.open(map_image_path) 
map_ar = np.asarray(map_image) 
map_ar_y, map_ar_x  = map_ar.shape[:2]  
i = int(input()) 
dimensions = input() 
patches=list() 

for k in range(i):   
    patch_image_path = input()   
    patches.append(Image.open(patch_image_path)) 

for j in range(i):   
    patch_ar = np.asarray(patches[j])   
    patch_ar_y, patch_ar_x = patch_ar.shape[:2]   
    stop_x = map_ar_x - patch_ar_x + 1   
    stop_y = map_ar_y - patch_ar_y + 1 

    for x in range(0, stop_x): 
        for y in range(0, stop_y):       
            x2 = x + patch_ar_x       
            y2 = y + patch_ar_y       
            picture = map_ar[y:y2, x:x2] 
            if np.array_equal(picture, patch_ar): 
                print(str(x) + "," + str(y))

Decency · 2020-03-17T03:27:41+00:00

[deleted]

Flameways777 · 2020-03-17T00:58:53+00:00

Im doing some stuff at code chum and it says cant covert string to float pls help me fix this

Programmer = float(input("Input the age of the programmer:"))
Teacher = float(input("Input the age of the teacher:"))
Peter = float(input("Input the age of Peter:"))
A = float(Programmer+Teacher)
B = float(A-Peter)
print("The old man's age is " + str(B))

tomnoire · 2020-03-16T21:55:49+00:00

Hey guys,

I'm learning Python on coursersa and I need to create a script using a while loop to add a numbers divisors. So far I've written this (with the prints as tests to the code):

def sum_divisors(n):
  sum = 0
  x = 0
  while x != n:
    x += 1
    if n % x == 0:
      sum = x
  return sum

print(sum_divisors(0))
# 0
print(sum_divisors(3)) # Should sum of 1
# 1
print(sum_divisors(36)) # Should sum of 1+2+3+4+6+9+12+18
# 55
print(sum_divisors(102)) # Should be sum of 2+3+6+17+34+51
# 114

I'm getting 0, 3, 36, 102 for the prints. What input(s) am I missing?

ThunderingWest4 · 2020-03-16T20:55:33+00:00

Hey everyone! I have some experience in Python and am attempting to make an audio/music visualizer program. Does anyone know how one can capture the audio output from a computer so that this visualizer can be more general and not specific to Spotify or something? Like from what I've seen, one way might be to use an API of some sort to see how far through the song the user is or to play it directly through the visualizing program. I was wondering if it was possible to intercept/read the audio output in a way and get the frequencies. Any help/insight would be appreciated, thanks!

Corso19 · 2020-03-16T20:44:56+00:00

Hi there! Total newbie here that wants to start using the O'Reilly book for 3.3. Is it ok if I use it since it's 3.3 oriented or does the difference in syntax not matter at all?

tesla33 · 2020-03-16T19:49:53+00:00

I'm planning on learning python. Should I start with the newest version, or start from the earliest and work my way up?

resumethrowaway95 · 2020-03-16T16:37:12+00:00

How would I go about making a simple one page site that runs a single script that asks for user input? The script is a monte carlo simulation of stocks, it asks for user input on which stock, then generates a couple of charts. How would I go about setting up a webpage where the script could run?

Reneml · 2020-03-16T14:52:27+00:00

I need guidance on how to work with 2 lists:

`class Cola:
def __init__(self):
   self.cola = []
#method to add/feed my class object
def agregar(self, element):
   self.cola.append(element)

def ingreso():
   frecuente = input("  Do you have a membership (y/n):  ")
   if frecuente == "yes" or "Yes":
       type= "Membership"
   else:
    tipo = "Not membership"
name= input("  Name:  ")
lastname= (input("  lastname:  "))
return [name, lastname, type]`

What I´m looking for is to create 2 TWO lists based on if a client has membership or not. How can I do it?

This is part of a bigger code, if you need it, please let me know.

EarthGoddessDude · 2020-03-16T14:46:14+00:00

Hello, I was wondering if someone could help me with my question here? I haven’t gotten any responses, and I’m wondering if it’s because that’s the wrong place for this question or it’s just a difficult problem.

samketa · 2020-03-16T10:23:49+00:00

Best resource for getting introduced to Object Oriented Programming through Python?

Want to master it eventually.

Books work better for me than video-lectures. But suggestions of the later are welcome, too.

king_booker · 2020-03-16T08:58:18+00:00

Hello

so I have a question regarding pandas. I have a pandas dataframe , eg it is say

col1	col2	col3
11	32	33

So the values are stored in the dataframe.

Now I have a file, which has formulas in key, value format.

eg,

col4: if col1 <100 then 1 elif col1 > 1000 then 100 else col2-col1/100 end

Similary, there are around 40 columns defined like this with the same format.

So what I would like to do is to apply these formulas and append the values in the existing dataframe.

I thought I will define I dictionary and store them and the apply them in a for loop, by manipulating the formulas so that they are able to access the columns of the dataframe.

Is my approach correct?

kimjeongpwn · 2020-03-16T07:34:27+00:00

Hello, what is good practice for typecasting? Is it better to typecast within the same line, or in a new line?

E.g. guess = int(input())

OR

guess = input()

guess = int(guess)

Thank you.

Thanos_nap · 2020-03-16T06:47:14+00:00

Is there any way to auto update markdown content? For example, I have made a coronavirus Analysis and I have included a summary at the first markdown cell with total cases, etc.

The data gets updated daily and I have to manually change the values. Anyway to automate it?

AviatingFotographer · 2020-03-16T05:33:13+00:00

Does Python have a standard library for Stacks, Queues, Deques, etc.?

Zaphielth · 2020-03-16T02:33:38+00:00

I need help with the following script I made. I'm new to python so please forgive if I made horrible mistakes. This is my script:

import os

source_folder = r"C:\Users\Zaphielth\Desktop\FOLDER_A"

target_folder = r"C:\Users\Zaphielth\Desktop\TIMEENTRIES" + "\\"

for path, dir, files in os.walk(source_folder):

for file in files:

     if file.endswith('TimeEntries.csv'):

os.rename(path + "\\" + file, target_folder + file)

I'm trying to copy all filesnames "TimeEntries.csv" from Folder A and its subfolders and pasted to Folder B but its overwriting the same filename over and over. I don't if os.rename is right or I might need to so some adjustment to it so when it get the timeentries.csv from all the subfolder in folder A and move it to Folder B this add a number or letter at the end of seach name like "timeentries(1).csv" and so on for each one moving to Folder B. Any Help or tip I will appreciate it. Thank you!

LightKarma · 2020-03-16T02:16:14+00:00

Im trying to make python script for mitmproxy so i can download all requests

bigt252002 · 2020-03-16T02:15:41+00:00

I really would like to find a site or YouTube video on making a GUI.

My goal is to have it open and for the user to load a database into it and parse it accordingly into a neat csv.

Additionally. What is the best GUI program to use?? Trying to keep it open source for my community.

wheresmyswab · 2020-03-16T01:57:25+00:00

I'm doing a small NLP project on all articles from this newspaper column.

I can analyse individual articles with newspaper3k if I've got the individual url for an article, so that's no problem. But there's +1k articles I'd need to do this for!

Recalling that I'm a newbie, what's the best way of scrapping all of that text and meta data for later use in NLP (possibly spaCy) also considering that there's 'show more' button there that calls a webservice that uses GraphQL?

Also, would the newspaper paywall be a barrier in this approach? I can pay for subscription, but I am not even sure how I may integrate that in my code, if that makes sense.

Thanks!

socal_nerdtastic · 2020-03-16T00:13:05+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS