Skip to content
#

attention-model

Here are 236 public repositories matching this topic...

A TensorFlow implementation of the Transformer model for machine translation tasks. This package includes data loading, model definition, and training scripts for translating Portuguese to English using the TED Talks dataset. The repository provides a complete pipeline from preprocessing the data to training and testing the model.

  • Updated Jul 7, 2024
  • Jupyter Notebook

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

  • Updated Jun 6, 2024
  • C#

Improve this page

Add a description, image, and links to the attention-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-model topic, visit your repo's landing page and select "manage topics."

Learn more