You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Model Card for Solade

Model Details

Model Description

Языковая модель на 1.2 миллиардов параметр,

  • Developed by: DLMveloper
  • Model type: Decoder-only transformer (text generation)
  • Language(s): Russian, English, Kazakh
  • License: [The license is being examined by lawyers]

Model Sources

Uses

Direct Use

Генерация текста на русском, английском, казахском языках.

Out-of-Scope Use

Модель обучена на ограниченном объёме данных (300 шагов), не предназначена для высокоточных или критичных задач.

Bias, Risks, and Limitations

Модель обучена на небольшом количестве шагов и может выдавать несвязный или некорректный текст.

How to Get Started with the Model

Training Details

Training Data

Датасет: DLMveloper/DLM_DataSet (подвыборка ~20000 примеров)

Training Procedure

Training Hyperparameters

  • Training regime: ???????
  • Steps: ???
  • Batch size: ?
  • Learning rate: ?????
  • Sequence length: ???.

Speeds, Sizes, Times

  • Размер модели: ????? (??-bit quantized)

Technical Specifications

Model Architecture and Objective

  • Параметров: ??
  • Слоёв: ??
  • Hidden size: ????
  • Attention heads: ??
  • Intermediate size (FFN): ????
  • Vocab size: ???
  • Компоненты: ???????

Compute Infrastructure

Software

???????????

Downloads last month
216
Safetensors
Model size
1B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train DLMveloper/Solade

Space using DLMveloper/Solade 1