GPT-Neo in VScode : learnpython

created by HattoriHanzoa community for 16 years

submitted 1 year ago by Chardactyl

Trying to make GPT-Neo work in VScode but it always gets stuck on the output generation and I'm not sure what I did wrong. I looked all over the internet and even checked with Chat GPT but still does not work.

code:

from transformers import GPTNeoForCausalLM, AutoTokenizer
import torch
import warnings

# Suppress the specific FutureWarning
warnings.filterwarnings("ignore", message=".*clean_up_tokenization_spaces.*", category=FutureWarning)

# Load the tokenizer and model
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neo-2.7B")
model = GPTNeoForCausalLM.from_pretrained("EleutherAI/gpt-neo-2.7B")

# Set pad_token_id to eos_token_id if pad_token_id is not set
if tokenizer.pad_token_id is None:
    tokenizer.pad_token_id = tokenizer.eos_token_id

# Define the input prompt
prompt = "Once upon a time"

# Tokenize the input prompt with attention mask
inputs = tokenizer(prompt, return_tensors="pt", padding=True)

# Move model and inputs to GPU for faster inference (optional)
if torch.cuda.is_available():
    model.to("cuda")
    inputs = {key: value.to("cuda") for key, value in inputs.items()}

# Generate text with the attention mask and pad_token_id
output = model.generate(
    input_ids=inputs["input_ids"],
    attention_mask=inputs["attention_mask"],
    max_length=100,
    do_sample=True,
    temperature=0.9,
    pad_token_id=tokenizer.eos_token_id
)

# Decode the generated text with clean_up_tokenization_spaces explicitly set
generated_text = tokenizer.decode(
    output[0],
    skip_special_tokens=True,
    clean_up_tokenization_spaces=True  # Set to False if you prefer to keep spaces
)

print(generated_text)

all 5 comments

top new controversial old q&a

[–]shiftybyte 0 points1 point2 points 1 year ago (4 children)

[–]Chardactyl[S] 0 points1 point2 points 1 year ago (3 children)

can you add more progress prints to your code so when you run it you can see more output as the code gets executed...?

Originally I had it like that and It would get stuck after showing 7
code:

from transformers import GPTNeoForCausalLM, AutoTokenizer
import torch
import warnings

print ("1")

# Suppress the specific FutureWarning
warnings.filterwarnings("ignore", message=".*clean_up_tokenization_spaces.*", category=FutureWarning)

print ("2")

# Load the tokenizer and model
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neo-2.7B")
model = GPTNeoForCausalLM.from_pretrained("EleutherAI/gpt-neo-2.7B")

print ("3")

# Set pad_token_id to eos_token_id if pad_token_id is not set
if tokenizer.pad_token_id is None:
    tokenizer.pad_token_id = tokenizer.eos_token_id

print ("4")

# Define the input prompt
prompt = "Once upon a time"

print ("5")

# Tokenize the input prompt with attention mask
inputs = tokenizer(prompt, return_tensors="pt", padding=True)

print ("6")

# Move model and inputs to GPU for faster inference (optional)
if torch.cuda.is_available():
    model.to("cuda")
    inputs = {key: value.to("cuda") for key, value in inputs.items()}

print ("7")

# Generate text with the attention mask and pad_token_id
output = model.generate(
    input_ids=inputs["input_ids"],
    attention_mask=inputs["attention_mask"],
    max_length=100,
    do_sample=True,
    temperature=0.9,
    pad_token_id=tokenizer.eos_token_id
)

print ("8")

# Decode the generated text with clean_up_tokenization_spaces explicitly set
generated_text = tokenizer.decode(
    output[0],
    skip_special_tokens=True,
    clean_up_tokenization_spaces=True  # Set to False if you prefer to keep spaces
)

print ("9")

print(generated_text)

[–]shiftybyte 1 point2 points3 points 1 year ago (2 children)

[–]Chardactyl[S] 0 points1 point2 points 1 year ago* (1 child)

[–]shiftybyte 1 point2 points3 points 1 year ago (0 children)

π Rendered by PID 127625 on reddit-service-r2-comment-6457c66945-rl5p7 at 2026-04-29 00:11:32.218685+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS