Back to tools
CodeT5
Code IntelligenceOpen SourceCode GenerationCode UnderstandingTransformer

About

CodeT5 is a family of open code large language models developed by Salesforce Research. It employs an encoder-decoder architecture that flexibly operates in different modes to support a wide range of code understanding and generation tasks. Trained on diverse pretraining tasks, including span denoising, causal language modeling, contrastive learning, and text-code matching, CodeT5 learns rich representations from both unimodal code data and bimodal code-text data. The models are available in various sizes, ranging from 220M to 16B parameters, catering to different computational needs and performance requirements.

Key Features

  • Supports multiple programming languages, including Python, Java, JavaScript, PHP, Ruby, Go, C, and C#.
  • Performs tasks such as code summarization, text-to-code generation, code-to-code translation, and code refinement.
  • Achieves state-of-the-art results on multiple code intelligence benchmarks.
  • Provides models of varying sizes to accommodate different use cases.
  • Open-source availability encourages community collaboration and research.

Pricing