Encoder-Decoder Neural-Machine-Translation

The implementation of the neural machine translation using tensorflow.

Run

You can directly run train_seq2seq_w_gru.py or train_seq2seq_w_lstm.py, depends on which neural network you want to use. GRU may be a little bit faster. The script contains data preprocessng, training, prediction on each epoch and evaluation with bleu score.

Required packages:

numpy 1.19.5
nltk 3.5
tensorflow 2.4.1

Dataset

The dataset "deu.txt" contains 208 486 pairs of English-German translations. It can be downloaded from here http://www.manythings.org/anki/ The project should work also for other languages. If there is any problem with encoding, encoding should be individually adjusted. The encoding type for a specific dataset can be checked using chardet library https://chardet.readthedocs.io/en/latest/usage.html

Data preprocessing

Before feeding the data into the neural network we must preprocess it: preprocess_data.py -> preprocess() The preprocess script involved:

creating encoder_input, decoder_input and decoder_output inputs
cleaning
tokenizing
creating word2idx_inputs, word2idx_inputs dictionaries
padding
shuffling
creating tensorflow's datasets from encoder_input, decoder_input, decoder_output
splitting into test/training set
creating batches within sets

Training the Encoder-Decoder LSTM model

Run train_lstm.py to train the model. Model subclassing: Encoder_LSTM.py, Decoder_LSTM.py. The prediction is picked using "Greedy Decoding".

Training the Encoder-Decoder GRU model

Run train_gru.py to train the model. Model subclassing: Encoder.py, Decoder.py

Attention Layer

You can run train_seq2seq_w_attn.py to train LSTM encoder-decoder model with custom attention layer (AttentionLayer.py). Training is very slow.

Model evaluation

Model is evaualte after the each epoch during the training, on the test data. For the evaluation was used a common machine translation metrc: BLEU("Bilingual Evaluation Understudy")

Test

After training the model is saved and can be tested using test_translation.py. The english input sentence must be specified as a "sequence" string variable inside the script.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
AttentionLayer.py		AttentionLayer.py
DecoderAttentionLSTM.py		DecoderAttentionLSTM.py
DecoderGRU.py		DecoderGRU.py
DecoderLSTM.py		DecoderLSTM.py
EncoderGRU.py		EncoderGRU.py
EncoderLSTM.py		EncoderLSTM.py
README.md		README.md
data_preprocessing.py		data_preprocessing.py
deu.txt		deu.txt
evaluate.py		evaluate.py
loss.py		loss.py
prediction.py		prediction.py
requirements.txt		requirements.txt
test_translation.py		test_translation.py
train_seq2seq_w_attn.py		train_seq2seq_w_attn.py
train_seq2seq_w_gru.py		train_seq2seq_w_gru.py
train_seq2seq_w_lstm.py		train_seq2seq_w_lstm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Encoder-Decoder Neural-Machine-Translation

Run

Required packages:

Dataset

Data preprocessing

Training the Encoder-Decoder LSTM model

Training the Encoder-Decoder GRU model

Attention Layer

Model evaluation

Test

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Encoder-Decoder Neural-Machine-Translation

Run

Required packages:

Dataset

Data preprocessing

Training the Encoder-Decoder LSTM model

Training the Encoder-Decoder GRU model

Attention Layer

Model evaluation

Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages