LogoLogo
Ctrlk
Continuum WebsiteContinuum ApplicationsContinuum KnowledgeAxolotl Platform
  • Continuum
  • Data
    • Datasets
  • MODELS
    • Foundation Models
  • Training
    • The Fine Tuning Process
      • Why fine tune?
      • Tokenization
      • Parameter Efficient Fine Tuning
      • Hyperparameters
      • Training Processes
        • Extending the context window
        • PyTorch Fully Sharded Data Parallel (FSDP)
        • Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
        • YaRN: Efficient Context Window Extension of Large Language Models
        • Sliding Window Attention
        • LongRoPE
        • Reinforcement Learning
        • An introduction to reinforcement learning
        • Reinforcement Learning from Human Feedback (RLHF)
        • Direct Preference Optimization: Your Language Model is Secretly a Reward Model
  • INFERENCE
    • Why is inference important?
  • KNOWLEDGE
    • Vector Databases
    • Retrieval Augmented Generation
    • Semantic Routing
    • Resource Description Framework (RDF)
  • AGENTS
    • What is agency?
  • Regulation and Ethics
    • Regulation and Ethics
  • DISRUPTION
    • Data Architecture
    • Search
    • Recommendation Engines
    • Logging
  • Infrastructure
    • The modern data centre
    • Servers and Chips
    • Networking and Connectivity
    • Data and Memory
    • Libraries and Complements
    • Vast Data Platform
    • Storage
Powered by GitBook
On this page
  1. Training
  2. The Fine Tuning Process

Training Processes

Extending the context windowPyTorch Fully Sharded Data Parallel (FSDP)Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationYaRN: Efficient Context Window Extension of Large Language ModelsSliding Window AttentionLongRoPEReinforcement LearningAn introduction to reinforcement learningReinforcement Learning from Human Feedback (RLHF)Direct Preference Optimization: Your Language Model is Secretly a Reward Model
PreviousCachingNextExtending the context window

Was this helpful?

LogoLogo

Continuum - Accelerated Artificial Intelligence

  • Continuum Website
  • Axolotl Platform

Copyright Continuum Labs - 2023

Was this helpful?