(전영훈) YoungHoon 8/21/24 (전영훈) YoungHoon 8/21/24

Understanding Fine-Tuning of Large Language Models: A Comprehensive Overview

Fine-tuning Large Language Models (LLMs) is crucial for customizing AI to meet specific business needs. This blog delves into the two primary types of fine-tuning: instruction tuning, which enhances a model's ability to follow complex commands, and alignment tuning, which ensures outputs align with human values. By understanding these processes, businesses can effectively leverage AI for customer support, content creation, and more.

(전영훈) YoungHoon 6/26/24 (전영훈) YoungHoon 6/26/24

SAAS - Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models

Introducing "SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models," a groundbreaking research paper by Upstage AI, Mathpresso Inc., and KT Corp. This novel approach leverages sequential learning, combining Chain-of-Thought (CoT) and Program-of-Thought (PoT) methodologies to significantly boost the mathematical reasoning and problem-solving skills of Large Language Models (LLMs). Our findings show that SAAS achieves state-of-the-art performance on benchmarks like GSM8K and MATH, outperforming larger models and setting new standards in AI-driven mathematical reasoning. Discover how SAAS can elevate your LLM’s capabilities.

(전영훈) YoungHoon 6/5/24 (전영훈) YoungHoon 6/5/24

Next Generation for AGI: Upstage’s On-Device LLM, WriteUp

Experience the power of AI at your fingertips with Upstage's on-device LLM, WriteUp. Enjoy AI assistance for writing tasks without the need for an internet connection, thanks to advanced model optimization and quantization technologies. Perfect for remote locations, WriteUp ensures data privacy and high performance on your personal device.

(전영훈) YoungHoon 5/22/24 (전영훈) YoungHoon 5/22/24

Introducing Solar Mini Chat ja: Expanding Language Support to Japanese

We are thrilled to announce that Solar mini chat now includes Japanese, alongside English and Korean. Specially fine-tuned for multi-turn chat, Solar mini chat ja excels in Japanese language interactions, offering high performance and an enhanced user experience. Ideal for applications demanding nuanced and context-aware communication, it surpasses many open-source models in key NLP tasks. Seamlessly integrate it with your existing API keys and elevate your Japanese chat applications!

(전영훈) YoungHoon 5/16/24 (전영훈) YoungHoon 5/16/24

Breaking Barriers: Revolutionize Your Work with Our Next-Level Embedding Model

Experience the Next Leap in Embedding Technology with Solar Embedding-1-Large: Our groundbreaking Solar Embedding-1-Large model is set to transform your work processes. With superior performance compared to OpenAI's models and a commitment to tackling even the toughest tasks, it's time to elevate your search systems and beyond.

(전영훈) YoungHoon 4/16/24 (전영훈) YoungHoon 4/16/24

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

Discover Evalverse: A groundbreaking framework revolutionizing Large Language Model evaluation. With its unified approach and user-friendly features, Evalverse simplifies assessment, making AI advancements inclusive and comprehensive. Explore its key features and architecture, and witness its practical application in our demonstrative video. Join us in driving innovation and accessibility in AI technology with Evalverse!

(최유정) Eujeong 3/26/24 (최유정) Eujeong 3/26/24

Open Source All About Data Processing, Dataverse

Dataverse is a freely-accessible open-source project designed to streamline the extract, transform, and load (ETL) pipeline using Python. In this post, we delve into the origins of this project and shed light on its future prospects in the realm of open-source data processing.

(최유정) Eujeong 3/8/24 (최유정) Eujeong 3/8/24

(Almost) Zero Hallucination with RAG and Groundedness Check

Unraveling the mechanics behind translating LLM outputs into leaderboard scores. Exploring a spectrum of evaluation mechanisms that span from automated metrics to human assessments rooted in real-world applicability.

(최유정) Eujeong 2/23/24 (최유정) Eujeong 2/23/24

LLM Evaluation Part2. Mechanics Behind LLM Scoring Systems

Unraveling the mechanics behind translating LLM outputs into leaderboard scores. Exploring a spectrum of evaluation mechanisms that span from automated metrics to human assessments rooted in real-world applicability.

(최유정) Eujeong 2/1/24 (최유정) Eujeong 2/1/24

LLM Evaluation Part1. What is a Benchmark Dataset?

Want to know why and how we evaluate LLM models?

Hailey(박성민) . 9/19/23 Hailey(박성민) . 9/19/23

[2023 AI KOREA GRAND PRIZE] Upstage Wins AI Technology Award (Minister of Science and ICT Award)

Upstage wins ‘AI Technology Award,’ the highest award given by the Minister of Science and ICT, at the 2023 AI Korea Awards Event!

종원 황 6/16/23 종원 황 6/16/23

Reinterpreting the History of NLP-based AI through a Data-Centric Perspective

What insights can be gained by examining natural language processing (NLP) through a data-centric perspective? Explore our blog post that delves into the 70-year history of AI, covering rule-based systems, machine learning, deep learning, and the recent emergence of large language models.

Hailey(박성민) . 4/12/23 Hailey(박성민) . 4/12/23

Data-Centric AI in the Real World

Just like a car needs fuel to move and a recipe requires ingredients to make a meal, artificial intelligence systems also need their own kind of fuel and materials, which is data. Explore the practical applications of data in the real world through this blog.

Hailey(박성민) . 2/21/23 Hailey(박성민) . 2/21/23

Until the birth of OCR that recognizes text (Upstage in-house OCR image data collection challenge)

Discover how Upstage builds its high-performance OCR solution, "Document AI," through our in-house image data collection event. Gain insights into the importance of data for AI model training and understand why it's a crucial component for success.

Overview

Solar Pro Preview: The most intelligent LLM on a single GPU

Upstage Document Parse: Let LLMs read your documents with speed and accuracy

Latest news

Understanding Fine-Tuning of Large Language Models: A Comprehensive Overview

SAAS - Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models

Next Generation for AGI: Upstage’s On-Device LLM, WriteUp

Introducing Solar Mini Chat ja: Expanding Language Support to Japanese

Breaking Barriers: Revolutionize Your Work with Our Next-Level Embedding Model

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

Open Source All About Data Processing, Dataverse

(Almost) Zero Hallucination with RAG and Groundedness Check

LLM Evaluation Part2. Mechanics Behind LLM Scoring Systems

LLM Evaluation Part1. What is a Benchmark Dataset?

[2023 AI KOREA GRAND PRIZE] Upstage Wins AI Technology Award (Minister of Science and ICT Award)

Reinterpreting the History of NLP-based AI through a Data-Centric Perspective

Data-Centric AI in the Real World

Until the birth of OCR that recognizes text (Upstage in-house OCR image data collection challenge)