LogoLogo
Ctrlk
Continuum WebsiteContinuum ApplicationsContinuum KnowledgeAxolotl Platform
  • Continuum
  • Data
    • Datasets
  • MODELS
    • Foundation Models
  • Training
    • The Fine Tuning Process
      • Why fine tune?
      • Tokenization
      • Parameter Efficient Fine Tuning
      • Hyperparameters
      • Training Processes
        • Extending the context window
        • PyTorch Fully Sharded Data Parallel (FSDP)
        • Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
        • YaRN: Efficient Context Window Extension of Large Language Models
        • Sliding Window Attention
        • LongRoPE
        • Reinforcement Learning
        • An introduction to reinforcement learning
        • Reinforcement Learning from Human Feedback (RLHF)
        • Direct Preference Optimization: Your Language Model is Secretly a Reward Model
  • INFERENCE
    • Why is inference important?
  • KNOWLEDGE
    • Vector Databases
    • Retrieval Augmented Generation
    • Semantic Routing
    • Resource Description Framework (RDF)
  • AGENTS
    • What is agency?
  • Regulation and Ethics
    • Regulation and Ethics
  • DISRUPTION
    • Data Architecture
    • Search
    • Recommendation Engines
    • Logging
  • Infrastructure
    • The modern data centre
    • Servers and Chips
    • Networking and Connectivity
    • Data and Memory
    • Libraries and Complements
    • Vast Data Platform
    • Storage
Powered by GitBook
On this page

Was this helpful?

  1. Training
  2. The Fine Tuning Process

Training Processes

Extending the context windowPyTorch Fully Sharded Data Parallel (FSDP)Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationYaRN: Efficient Context Window Extension of Large Language ModelsSliding Window AttentionLongRoPEReinforcement LearningAn introduction to reinforcement learningReinforcement Learning from Human Feedback (RLHF)Direct Preference Optimization: Your Language Model is Secretly a Reward Model
PreviousCachingNextExtending the context window

Was this helpful?

LogoLogo

Continuum - Accelerated Artificial Intelligence

  • Continuum Website
  • Axolotl Platform

Copyright Continuum Labs - 2023