Table of Contents

Enum SentimentAnalysis.TrainingDataset

Namespace
LMKit.TextAnalysis
Assembly
LM-Kit.NET.dll

Enumeration representing the built-in training datasets available for fine-tuning language models (LLMs) for sentiment analysis using LMKit.

public enum SentimentAnalysis.TrainingDataset

Fields

KotziasKDD2015 = 0

A subset of the dataset from the paper "From Group to Individual Labels Using Deep Features" by Kotzias et al., presented at KDD 2015. This dataset contains only positive and negative sentiment samples in English.

Reference: From Group to Individual Labels Using Deep Features

LMKit2024_09_INT = 1

A dataset containing mostly synthetic samples generated using OpenAI's models. This dataset includes positive, negative, and neutral sentiment samples across various languages, designed for international applications.