Cookie Thread Act 6: Cookie & Thread

Arete · September 12, 2024, 5:25am

this is what I got when I ran it

Arete · September 12, 2024, 5:25am

so I think it’s not finding the threads at all

notblackorwhite · September 12, 2024, 5:28am

It almost certainly just spat out code nearly verbatim from its data set. Someone tried to show me how good ChatGPT was at writing boilerplate code and it spit out a partial, non-working example from the library’s docs that was the literal first link when you googled the question. Like… Yeah it did manage to grab the code marginally faster, but it literally wouldn’t run and just straight up copied from the docs lol.

But yeah point is the odds of a non-working sample is pretty good

notblackorwhite · September 12, 2024, 5:28am

Right your list is length zero.

Arete · September 12, 2024, 5:29am

technically mine is running! it’s just not doing anything useful

Arete · September 12, 2024, 5:30am

anyways I am aware that the sample isn’t working but I don’t know how to fix it

notblackorwhite · September 12, 2024, 5:35am

Uh try like thread_links = soup.find_all('a', class_='post-subject')

You may wanna double check the class name. I went from memory

notblackorwhite · September 12, 2024, 5:37am

Am I crazy or did it have you import re without using it lol

Arete · September 12, 2024, 5:37am

how do I do that? from previous conversation with the bot I understand it to involve inspect element in some fashion but I’m not clear on what I’m actually supposed to do

Arete · September 12, 2024, 5:38am

I think it did yeah

Arete · September 12, 2024, 5:39am

to be fair to the bot I totally did that sort of thing all the time in high school robotics-

notblackorwhite · September 12, 2024, 5:40am

You should be able to right-click a thread link, and see “inspect” in the context menu.

The class name would be what’s quoted in the class attribute. There may be multiple classes and there may be other a elements on the page with a given class than the one you want, but I’d have to look myself to say for sure

Marshal · September 12, 2024, 5:41am

@tutuu 22

notblackorwhite · September 12, 2024, 5:43am

If you can’t figure it out tonight, I can help you tomorrow and I almost guarantee I can get it working in minutes. Lili could too and it’ll be up before me so you don’t even have to wait for me

Arete · September 12, 2024, 5:54am

it says <td class="post-subject vtop even">

does that mean I can do it with just thread_links = soup.find_all('a', class_='post-subject'), I don’t need to worry about vtop or even because those are separate classes?

Arete · September 12, 2024, 5:56am

ok I ran it and got the error message

C:\Users\Francis>python C:\Users\Francis\Downloads\glowfic_scraper.py
Traceback (most recent call last):
  File "C:\Users\Francis\Downloads\glowfic_scraper.py", line 38, in <module>
    main()
  File "C:\Users\Francis\Downloads\glowfic_scraper.py", line 28, in main
    thread_links = soup.find_all('a', class_='post-subject')
                   ^^^^
NameError: name 'soup' is not defined

which I was going to say was weird since elsewhere I have soup = BeautifulSoup(response.text, 'html.parser'), but both of those are in the definitions of separate functions so maybe that’s why it’s having issues? iirc that’s one of the things Python’s annoying about

Squirrel2412 · September 12, 2024, 6:06am

Is response.text supposed to be a .txt file?

Squirrel2412 · September 12, 2024, 6:08am

Wait nvm i saw the full code

orangeandblack5 · September 12, 2024, 6:15am

that was me

orangeandblack5 · September 12, 2024, 6:15am

and yeah I mean GPT isn’t gonna do the whole thing for you but it can help point you in the right direction and take care of really menial stuff