getting loss as 'nan' after 1st epoch only? #47

shubhamk16 · 2020-06-11T09:03:16Z

Hello guys,
just like the Glove, I created a dictionary of all the possible words. with keys as words and values as 768 embedding vector for BERT.
But when I use this dictionary and train the model, the loss is getting nan in 1st epoch only.

How to handle this problem?
what are the possible reasons for getting a loss 'nan'?
Is this a good approach, to make a dictionary of embedding vectors?

alkaideemo · 2020-06-30T06:28:11Z

I got a similar problem. Here it's not numerical stable while computing the loss.

IRNet/src/model.py

Lines 308 to 309 in c329460

    
           sketch_prob_var = torch.stack( 
        
               [torch.stack(action_probs_i, dim=0).log().sum() for action_probs_i in action_probs], dim=0)

IRNet/src/model.py

Lines 479 to 480 in c329460

    
           lf_prob_var = torch.stack( 
        
               [torch.stack(action_probs_i, dim=0).log().sum() for action_probs_i in action_probs], dim=0)

I add a small number before log operation, problem solved.

liguozhanglearner · 2020-10-23T04:52:44Z

i have no idea about the loss function computing way

ersaurabhverma · 2022-01-22T20:32:47Z

Try reducing the learning rate.
Your gradient is exploding due to high learning rate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

getting loss as 'nan' after 1st epoch only? #47

getting loss as 'nan' after 1st epoch only? #47

shubhamk16 commented Jun 11, 2020 •

edited

Loading

alkaideemo commented Jun 30, 2020

liguozhanglearner commented Oct 23, 2020

ersaurabhverma commented Jan 22, 2022

getting loss as 'nan' after 1st epoch only? #47

getting loss as 'nan' after 1st epoch only? #47

Comments

shubhamk16 commented Jun 11, 2020 • edited Loading

alkaideemo commented Jun 30, 2020

liguozhanglearner commented Oct 23, 2020

ersaurabhverma commented Jan 22, 2022

shubhamk16 commented Jun 11, 2020 •

edited

Loading