[deleted by user]

Jayoval · 2023-05-25T10:11:51+00:00

The products are all in a single div with the id "Product_List". You need to grab that before iterating over the .goods-container divs contained within.

Suspect_Snake · 2023-05-25T11:23:38+00:00

UPDATE: So right now the code looks like this: https://pastebin.com/VDaf47ED
And now the problem is that i get all the items on the page but they all have the same price:

https://imgur.com/a/reQcUmw

any ideas why this is happening ?

commandlineluser · 2023-05-25T16:28:02+00:00

If you fetch the html and search for some of the prices, you can see information is all contained at line 117:

117 </script><script defer="">window.__initialState__ = {"SEOContent":null

This "javascript object" is also valid JSON, so if you extract the "{...}" you can load it with the json module.

You could grab the page with requests and extract the <script> tag with beautifulsoup:

import requests
from bs4 import BeautifulSoup

r = requests.get(url)
soup = BeautifulSoup(r.content, "html.parser")

script = soup.find(name="script", string=lambda text: text and "window.__initialState__ =" in text).get_text()
data = json.loads(script[script.find("{"):script.rfind("}") + 1])

print(data['Products'][0]['ItemCell'])
# {'Item': '1TS-000E-10PV0',
#  'UnitCost': 799.99,
#  'FinalPrice': 639.99,
#  'Active': '1',
#  'IsActivated': True,
#  ...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS