Killed - python train.py - CPU #23

sardetushar · 2023-07-19T06:45:46Z

BERT-type: uncased_L-12_H-768_A-12
Batch_size = 8
BERT parameters:
learning rate: 1e-05
Fine-tune BERT: True
vocab size: 30522
hidden_size: 768
num_hidden_layer: 12
num_attention_heads: 12
hidden_act: gelu
intermediate_size: 3072
hidden_dropout_prob: 0.1
attention_probs_dropout_prob: 0.1
max_position_embeddings: 512
type_vocab_size: 2
initializer_range: 0.02
Load pre-trained parameters.
Seq-to-SQL: the number of final BERT layers to be used: 2
Seq-to-SQL: the size of hidden dimension = 100
Seq-to-SQL: LSTM encoding layer size = 2
Seq-to-SQL: dropout rate = 0.3
Seq-to-SQL: learning rate = 0.001
Killed

guotong1988 · 2023-09-18T01:36:53Z

Out of memory. I guess.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Killed - python train.py - CPU #23

Killed - python train.py - CPU #23

sardetushar commented Jul 19, 2023

guotong1988 commented Sep 18, 2023

Killed - python train.py - CPU #23

Killed - python train.py - CPU #23

Comments

sardetushar commented Jul 19, 2023

guotong1988 commented Sep 18, 2023