0% found this document useful (0 votes)

45 views53 pages

Demystifying LLMs

Uploaded by

민냥

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views53 pages

Demystifying LLMs

Uploaded by

민냥

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

Demystifying LLMs

Devendra Singh Chaplot

Mistral AI

Feb 13, 2024

Mistral AI
Co-Founders

Arthur Mensch Timothée Lacroix Guillaume Lample

CEO CTO Chief Scientist
Former AI researcher at Former AI researcher at Former AI Researcher at
DeepMind, Polytechnique alum Meta, ENS alum Meta, Polytechnique alum

Releases

$500M+ funding, Of ces in Paris/London/SF Bay Area

fi
Mistral AI LLMs
Contents
• Stages of LLM Training:

• Pretraining

• Instruction-Tuning

• Learning from Human Preferences: DPO/RLHF

• Evaluation of LLMs

• Retrieval Augmented Generation (RAG)

• Recipe for RAG with code

Stages of LLM Training
1. Pretraining

2. Instruction-Tuning

3. Learning from Human Feedback

Stages of LLM Training
1. Pretraining

2. Instruction-Tuning

3. Learning from Human Feedback

Pretraining
Pretraining
We introduce Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) language model. Mixtral has the same
architecture as Mistral 7B, with the difference that each layer is composed of 8 feedforward blocks (i.e.
experts). For every token, at each layer, a router network selects two experts to process the current
state and combine their outputs. Even though each token only sees two experts, the selected experts
can be different at each timestep. As a result, each token has access to 47B parameters, but only uses
13B active parameters during inference. Mixtral was trained with a context size of 32k tokens and it
outperforms or matches Llama 2 70B and GPT-3.5 across all evaluated benchmarks. In particular,
Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.
We also provide a model
fi
Pretraining
We introduce Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) language model. Mixtral has the same
architecture as Mistral 7B, with the difference that each layer is composed of 8 feedforward blocks (i.e.
experts). For every token, at each layer, a router network selects two experts to process the current
state and combine their outputs. Even though each token only sees two experts, the selected experts
can be different at each timestep. As a result, each token has access to 47B parameters, but only uses
13B active parameters during inference. Mixtral was trained with a context size of 32k tokens and it
outperforms or matches Llama 2 70B and GPT-3.5 across all evaluated benchmarks. In particular,
Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.
We also provide a model

Large Language Model

O(1-100B) parameters

We
fi
Pretraining
We introduce Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) language model. Mixtral has the same
architecture as Mistral 7B, with the difference that each layer is composed of 8 feedforward blocks (i.e.
experts). For every token, at each layer, a router network selects two experts to process the current
state and combine their outputs. Even though each token only sees two experts, the selected experts
can be different at each timestep. As a result, each token has access to 47B parameters, but only uses
13B active parameters during inference. Mixtral was trained with a context size of 32k tokens and it
outperforms or matches Llama 2 70B and GPT-3.5 across all evaluated benchmarks. In particular,
Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.
We also provide a model

introduce

Large Language Model

O(1-100B) parameters

introduce Mixtral

Large Language Model

O(1-100B) parameters

We introduce
fi
Pretraining
We introduce Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) language model. Mixtral has the same
architecture as Mistral 7B, with the difference that each layer is composed of 8 feedforward blocks (i.e.
experts). For every token, at each layer, a router network selects two experts to process the current
state and combine their outputs. Even though each token only sees two experts, the selected experts
can be different at each timestep. As a result, each token has access to 47B parameters, but only uses
13B active parameters during inference. Mixtral was trained with a context size of 32k tokens and it
outperforms or matches Llama 2 70B and GPT-3.5 across all evaluated benchmarks. In particular,
Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.
We also provide a model

introduce Mixtral 8x7B , a Sparse Mixture

Large Language Model

O(1-100B) parameters

We introduce Mixtral 8x7B , a Sparse

fi
Pretraining
• Task: Next token prediction
Pretraining
• 1 token ~= 0.75 word
introduce Mixtral 8x7B
• Vocab size: O(10K) tokens
• Each token is represented by an integer
Large Language Model
(LLM)

We introduce Mixtral
Pretraining
• Why is it hard?
Pretraining
• Why is it hard?
• Time: Datasets are huge - O(1T) tokens
• Preprocessing, Cleaning, Deduplication
• More data might not lead to better model

Llama pretraining data mixture

Pretraining
• Why is it hard?
• Time: Datasets are huge - O(1T) tokens
• Preprocessing, Cleaning, Deduplication
• More data might not lead to better model
• Money: O(1-100B) parameters
• O(1-10K) GPUs for weeks or months Llama pretraining data mixture
• O($10-100M) per model
Pretraining
• Why is it hard?
• Time: Datasets are huge - O(1T) tokens
• Preprocessing, Cleaning, Deduplication
• More data might not lead to better model
• Money: O(1-100B) parameters
• O(1-10K) GPUs for weeks or months Llama pretraining data mixture
• O($10-100M) per model
• YOLO: decide model architecture, hyper
parameters, data mixture for the “big run”
• Best hyper-parameters for a smaller model
Llama Model sizes, architectures, and
might not be the best for a larger model optimization hyper-parameters.
Can we use the Pretrained model?
Can we use the Pretrained model?
Prompt:

Write a python function to nd whether the input number is prime.

fi
Can we use the Pretrained model?
Prompt:

Write a python function to nd whether the input number is prime.

Response:
Example:

Input: 17
Output: True

Input: 15
Output: False

Approach:

1. Let’s assume that the input number is n

2. Check if n is divisible by 1. If n is divisible by 1 then it is not a prime number.
3. Check if n is divisible by 2. If n is divisible by 2 then it is not a prime number.
fi
Can we use the Pretrained model?
Prompt:

def is_prime(x: int):

"""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
Can we use the Pretrained model?
Prompt:

def is_prime(x: int):

"""
takes as input an integer x. Returns True if x is prime and False otherwise
"""

Response:

if x <= 1:
return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True
Can we use the Pretrained model?
Prompt:

def is_prime(x: int):

"""
takes as input an integer x. Returns True if x is prime and False otherwise
"""

Response:

if x <= 1:
return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True

Model knows the answer but it is not aligned with human preferences
Stages of LLM Training
1. Pretraining

2. Instruction-Tuning

3. Learning from Human Feedback

Instruction Tuning
Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True
fi
Instruction Tuning
Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True

Large Language Model

O(1-100B) parameters

[INST] Write … [\INST]

fi
Instruction Tuning
Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True

def

Large Language Model

O(1-100B) parameters

[INST] Write … [\INST] def

fi
Instruction Tuning
Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True

def is_prime

Large Language Model

O(1-100B) parameters

[INST] Write … [\INST] def

fi
Instruction Tuning
Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True

def is_prime (x) :

Large Language Model

O(1-100B) parameters

[INST] Write … [\INST] def is_prime (x)

fi
Instruction Fine-tuning

• Dataset Instruction-tuning

• Paired: (Prompt, Response)

def is_prime
• O(10-100K instructions)

Large Language Model

(LLM)

[INST] … [\INST] Def

Instruction Fine-tuning

• Dataset Instruction-tuning

• Paired: (Prompt, Response)

def is_prime
• O(10-100K instructions)
• Task:
Large Language Model
• Next word prediction (Masked) (LLM)

[INST] … [\INST] Def

Instruction Fine-tuning

• Dataset Instruction-tuning

• Paired: (Prompt, Response)

def is_prime
• O(10-100K instructions)
• Task:
Large Language Model
• Next word prediction (Masked) (LLM)
• Compute:
• O(1-100) GPUs [INST] … [\INST] Def

• Few hrs/days
Stages of LLM Training
1. Pretraining

2. Instruction-Tuning

3. Learning from Human Feedback

Human Preferences
Human preferences are cheaper/easier than human annotation

Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response 1: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True
def is_prime(x: int):
“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response 2: return False
for i in range(2, x):
if x % i == 0:
return False
return True
fi
Human Preferences
Human preferences are cheaper/easier than human annotation

Prompt: [INST] Write a python function to nd whether the input number is prime. [\INST]

def is_prime(x: int):

“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response 1: return False
for i in range(2, int(x ** 0.5) + 1):
if x % i == 0:
return False
return True
def is_prime(x: int):
“""
takes as input an integer x. Returns True if x is prime and False otherwise
"""
if x <= 1:
Response 2: return False
for i in range(2, x):
if x % i == 0:
return False
return True
Response 1 > Response 2
fi
Reinforcement Learning
from Human Feedback (RLHF)

[Deep Reinforcement Learning from Human Preferences. Christiano et al. 2017]

Direct Preference Optimization (DPO)

[Deep Reinforcement Learning from Human Preferences. Christiano et al. 2017]

[Direct Preference Optimization: Your Language Model is Secretly a Reward Model. Rafailov et al. 2023]
Stages of LLM Training
Pretraining Instruction-Tuning Learning from Human Feedback

Dataset: Dataset: Dataset:

Raw text Paired: (Prompt, Response) Human Preference Data
Few trillions of tokens O(10-100K instructions) O(10-100K)
Task: Task: Task:
Next word prediction Next word prediction (Masked) RLHF/DPO
Compute: Compute: Compute:
O(1-10K) GPUs O(1-100) GPUs O(1-100) GPUs
Weeks/months of training Few hrs/days Few hrs/days
Evaluation of LLMs
Evaluation of pretrained models
Evaluation of pretrained models
0-shot:

def is_prime(x: int):

"""
takes as input an integer x.
Returns True if x is prime and False otherwise
"""
Evaluation of pretrained models
0-shot:

def is_prime(x: int):

"""
takes as input an integer x.
Returns True if x is prime and False otherwise
"""

3-shot:
## How old is Barack Obama in 2014?

Barack Obama is 57 years old in 2014.

## What is Barack Obama’s birthday?

Barack Obama was born on August 4, 1961.

## What is the name of Barack Obama’s wife?

Barack Obama’s wife is Michelle Obama.

## How tall is Barack Obama?

Evaluation of pretrained models
0-shot:

def is_prime(x: int):

"""
takes as input an integer x.
Returns True if x is prime and False otherwise
"""

3-shot:
## How old is Barack Obama in 2014?

Barack Obama is 57 years old in 2014.

## What is Barack Obama’s birthday?

Barack Obama was born on August 4, 1961.

## What is the name of Barack Obama’s wife?

Barack Obama’s wife is Michelle Obama.

## How tall is Barack Obama?

Evaluation of Instruction-tuned models

https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard
Evaluation of Instruction-tuned models

LMSYS Chatbot Arena Leaderboard

https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard
Evaluation of Instruction-tuned models
• Proxies for human evaluation:
• MT Bench:

• Ask GPT-4 to score responses

• 0.90 correlation with human
preferences
• Alpaca Eval:

• Compare win-rate against GPT-4 (v2)

• 0.84 correlation with human
preferences
Practical tips
• Proprietary vs Open-Source
• For proprietary models:
• Prompt Engineering: Few-shot prompting, Chain-of-thought
• Retrieval Augmented Generation (RAG)
Practical tips
• Proprietary vs Open-Source
• For proprietary models:
• Prompt Engineering: Few-shot prompting, Chain-of-thought
• Retrieval Augmented Generation (RAG)
• For open-source
• Everything above
• Task-speci c ne-tuning and DPO: Need data and a bit of compute
fi
fi
Practical tips
Open-source Proprietary
• Proprietary vs Open-Source
• For proprietary models:
• Prompt Engineering: Few-shot prompting, Chain-of-thought
• Retrieval Augmented Generation (RAG)
• For open-source
• Everything above
• Task-speci c ne-tuning and DPO: Need data and bit of compute
• Balance performance vs cost (training and inference)
• Proprietary models higher general-purpose performance
• Open-source models can beat proprietary models on speci c tasks
with ne-tuning
• Proprietary models typically have higher inference cost
Price
0.42€ 1.8€ 7.5€
(per M tokens)
fi
fi
fi
fi
Retrieval Augmented Generation (RAG)
Retrieval Augmented Generation (RAG)
When do we need Retrieval Augmented Generation (RAG)?
• LLM doesn’t know everything, sometimes require task-speci c knowledge
• Sometimes you want LLMs to answer queries based on some data source to reduce hallucinations
• Knowledge resource doesn’t t in the context window of the LLM

[Figure from https://lemaoliu.github.io/retrieval-generation-tutorial/]

fi
fi
Recipe for RAG

[Figure from https://gradient ow.substack.com/p/best-practices-in-retrieval-augmented]

fl
Basic RAG code
https://docs.mistral.ai/guides/basic-RAG/

Transcription Course 2
No ratings yet
Transcription Course 2
11 pages
Deep Learning: Large Language Models
No ratings yet
Deep Learning: Large Language Models
58 pages
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
No ratings yet
S 001: N Q A C LLM E: Afurai EW Ualitative Pproach For ODE Valuation
22 pages
Viva Questions 2023
No ratings yet
Viva Questions 2023
21 pages
Rohan Reflections
No ratings yet
Rohan Reflections
8 pages
Mixtral of Experts
No ratings yet
Mixtral of Experts
13 pages
代码大模型
No ratings yet
代码大模型
18 pages
W03 Benchmarking
No ratings yet
W03 Benchmarking
25 pages
Foundations of Large Language Models 1738142777
No ratings yet
Foundations of Large Language Models 1738142777
101 pages
Foundations of LLM
100% (1)
Foundations of LLM
231 pages
Deepseek LLM
No ratings yet
Deepseek LLM
48 pages
Model Pretraining
No ratings yet
Model Pretraining
11 pages
AI Professional Workshop
No ratings yet
AI Professional Workshop
32 pages
Answer: A
No ratings yet
Answer: A
48 pages
Achieving Peak Performance For Large Language
No ratings yet
Achieving Peak Performance For Large Language
34 pages
Open Mixture-of-Experts Language Models
No ratings yet
Open Mixture-of-Experts Language Models
61 pages
Evolm: in Search of Lost Language Model Training Dynamics: Harvard Stanford Epfl Cmu
No ratings yet
Evolm: in Search of Lost Language Model Training Dynamics: Harvard Stanford Epfl Cmu
28 pages
DeepSeek-V2: Efficient MoE Language Model
No ratings yet
DeepSeek-V2: Efficient MoE Language Model
52 pages
Recent Advances in Language Modeling (2022-2025)
No ratings yet
Recent Advances in Language Modeling (2022-2025)
5 pages
DeepSeek-V2: Efficient MoE Language Model
No ratings yet
DeepSeek-V2: Efficient MoE Language Model
50 pages
Balancing Enhancement, Harmlessness, and General Capabilities Enhancing Conversational LLMs With Direct RLHF
No ratings yet
Balancing Enhancement, Harmlessness, and General Capabilities Enhancing Conversational LLMs With Direct RLHF
13 pages
Local LLMs: Key Terms and Concepts
No ratings yet
Local LLMs: Key Terms and Concepts
13 pages
Nichols Et Al. - 2024 - HPC-Coder Modeling Parallel Programs Using Large Language Models
No ratings yet
Nichols Et Al. - 2024 - HPC-Coder Modeling Parallel Programs Using Large Language Models
12 pages
Summer Course Material
No ratings yet
Summer Course Material
52 pages
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
No ratings yet
O C: T O C T - T C L L M: PEN Oder HE PEN Ookbook For OP IER ODE Arge Anguage Odels
35 pages
DeepSeek-V2: Efficient 236B MoE Language Model
No ratings yet
DeepSeek-V2: Efficient 236B MoE Language Model
50 pages
Predibase Fine-Tuning LLMs Ebook
No ratings yet
Predibase Fine-Tuning LLMs Ebook
20 pages
OpenCoder 1731317971
No ratings yet
OpenCoder 1731317971
35 pages
Efficient Large Language Models Survey
No ratings yet
Efficient Large Language Models Survey
67 pages
LLMs in Python Free Course by Inder P Singh
No ratings yet
LLMs in Python Free Course by Inder P Singh
28 pages
Foundations of Large Language Models: Tong Xiao and Jingbo Zhu
No ratings yet
Foundations of Large Language Models: Tong Xiao and Jingbo Zhu
277 pages
Program Synthesis With Large Language Models: Jacob Austin Augustus Odena
No ratings yet
Program Synthesis With Large Language Models: Jacob Austin Augustus Odena
34 pages
Inference Efficiency by Learning Task Complexity
No ratings yet
Inference Efficiency by Learning Task Complexity
9 pages
Exam Killer
100% (1)
Exam Killer
246 pages
Quiz AI2
No ratings yet
Quiz AI2
11 pages
Token-by-Token Regeneration and Domain Biases - A Benchmark of LLMs On Advanced Mathematical Problem-Solving
No ratings yet
Token-by-Token Regeneration and Domain Biases - A Benchmark of LLMs On Advanced Mathematical Problem-Solving
8 pages
CSE545 sp23 (5) Neural Network Workflows 2-26
No ratings yet
CSE545 sp23 (5) Neural Network Workflows 2-26
100 pages
Code Generation With LLMs
No ratings yet
Code Generation With LLMs
59 pages
Course 2 Outline
No ratings yet
Course 2 Outline
4 pages
P LM: A A E B - LLM I T O: Anda N Utomatic Valuation Ench Mark For Nstruction Uning Ptimization
No ratings yet
P LM: A A E B - LLM I T O: Anda N Utomatic Valuation Ench Mark For Nstruction Uning Ptimization
21 pages
Tutorial 1 Question
No ratings yet
Tutorial 1 Question
3 pages
AM-Thinking-v1: Advancing The Frontier of Reasoning at 32B Scale
No ratings yet
AM-Thinking-v1: Advancing The Frontier of Reasoning at 32B Scale
16 pages
ML Libraries
No ratings yet
ML Libraries
19 pages
pdf2306 08997 PDF
No ratings yet
pdf2306 08997 PDF
20 pages
Assessing Fine-Tuning Efficacy in LLMS: A Case Study With Learning Guidance Chatbots
No ratings yet
Assessing Fine-Tuning Efficacy in LLMS: A Case Study With Learning Guidance Chatbots
11 pages
ML Engineer Exam Prep Guide
No ratings yet
ML Engineer Exam Prep Guide
9 pages
Review 1 Capstone
No ratings yet
Review 1 Capstone
9 pages
UNIT 2 Deep Learing Answers
No ratings yet
UNIT 2 Deep Learing Answers
42 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
50 LLM Interview Questions
100% (2)
50 LLM Interview Questions
56 pages
Test 1
No ratings yet
Test 1
9 pages
Autoencoding Models (Encoder Only) : Three LLM Architectures
No ratings yet
Autoencoding Models (Encoder Only) : Three LLM Architectures
5 pages
P G - C 2: B L L M C R F: AN U Oder Oosting Arge Anguage Odels For Ode With Anking Eedback
No ratings yet
P G - C 2: B L L M C R F: AN U Oder Oosting Arge Anguage Odels For Ode With Anking Eedback
15 pages
Introduction To ML
No ratings yet
Introduction To ML
34 pages
Deep Learning Library PDF
No ratings yet
Deep Learning Library PDF
12 pages
Sorted LLaMA: Dynamic Inference in NLP
No ratings yet
Sorted LLaMA: Dynamic Inference in NLP
17 pages
Coin Metrics Crypto Asset Valuation Primer I
No ratings yet
Coin Metrics Crypto Asset Valuation Primer I
11 pages
101 Productivity Boosting ChatGPT Prompts
100% (3)
101 Productivity Boosting ChatGPT Prompts
28 pages
Predicting Individual Equity Options
No ratings yet
Predicting Individual Equity Options
38 pages
Introduction To Transformers
No ratings yet
Introduction To Transformers
187 pages
Multimodal Chain-of-Thought Reasoning
No ratings yet
Multimodal Chain-of-Thought Reasoning
25 pages
LLM For Recommandation
No ratings yet
LLM For Recommandation
101 pages
AI For Everyone
No ratings yet
AI For Everyone
23 pages
LLM Basics for Researchers
No ratings yet
LLM Basics for Researchers
54 pages
Rise of LLM
100% (1)
Rise of LLM
64 pages
A Guide To GenerativeAI (GAI) and Large Language Models (LLMS)
No ratings yet
A Guide To GenerativeAI (GAI) and Large Language Models (LLMS)
14 pages
ChatGPT's Ethical Challenges in Finance
No ratings yet
ChatGPT's Ethical Challenges in Finance
8 pages
LLM Fince-Tuning
No ratings yet
LLM Fince-Tuning
16 pages
Navigating Cyber 2025
No ratings yet
Navigating Cyber 2025
26 pages
AIF-C01 Demo
No ratings yet
AIF-C01 Demo
5 pages
Rag-From-Scratch:rag - From - Scratch - 1 - To - 4.ipynb at Main Langchain-Ai:rag-From-Scratch
No ratings yet
Rag-From-Scratch:rag - From - Scratch - 1 - To - 4.ipynb at Main Langchain-Ai:rag-From-Scratch
8 pages
Mod 1
No ratings yet
Mod 1
31 pages
PDF Q&A with Langchain & OpenAI
No ratings yet
PDF Q&A with Langchain & OpenAI
58 pages
Brexhq - Prompt-Engineering - Tips and Tricks For Working With Large Language Models Like OpenAI's GPT-4
No ratings yet
Brexhq - Prompt-Engineering - Tips and Tricks For Working With Large Language Models Like OpenAI's GPT-4
12 pages
Wjarr 2025 0279
No ratings yet
Wjarr 2025 0279
11 pages
LLMs and Generative AI For (Z-Library)
100% (5)
LLMs and Generative AI For (Z-Library)
58 pages
AI-Driven Punctuation Restoration
No ratings yet
AI-Driven Punctuation Restoration
5 pages
Major Project Yatin
No ratings yet
Major Project Yatin
40 pages
Oracle Cloud Infrastructure 2025 Generative AI Professional
25% (4)
Oracle Cloud Infrastructure 2025 Generative AI Professional
8 pages
Authoring - Multilingual Guardrails Fake PII - Agnostic
No ratings yet
Authoring - Multilingual Guardrails Fake PII - Agnostic
7 pages
1 PB
No ratings yet
1 PB
9 pages
DeepSeek R1
No ratings yet
DeepSeek R1
22 pages
MoE-Infinity - Offloading-Efficient MoE Model Serving
No ratings yet
MoE-Infinity - Offloading-Efficient MoE Model Serving
14 pages
Week-3 - +Prompt+Engineering+w - o+Code+-+Caselets
No ratings yet
Week-3 - +Prompt+Engineering+w - o+Code+-+Caselets
6 pages
Hey Robot! Build Your Own AI Companion - Make
No ratings yet
Hey Robot! Build Your Own AI Companion - Make
21 pages
How Is Generative AI Transforming Supply Chain Operations and Efficiency
No ratings yet
How Is Generative AI Transforming Supply Chain Operations and Efficiency
97 pages
Ding 等 - 2024 - QUAR-VLA Vision-Language-Action Model for Quadruped Robots
No ratings yet
Ding 等 - 2024 - QUAR-VLA Vision-Language-Action Model for Quadruped Robots
25 pages
Generating Requirements Elicitation Interview Scripts With Large Language Models
No ratings yet
Generating Requirements Elicitation Interview Scripts With Large Language Models
8 pages
Roadmapping
No ratings yet
Roadmapping
120 pages
Engineering Onboarding & Tech Stack Overview
No ratings yet
Engineering Onboarding & Tech Stack Overview
5 pages
L'Oreal - Gen AI As A Service With Cloud Run & LangChain
No ratings yet
L'Oreal - Gen AI As A Service With Cloud Run & LangChain
3 pages
The Principle of Information Influence Layering A Theoretical Framework and Engineering Methodology For Systematic Prompt Optimization
No ratings yet
The Principle of Information Influence Layering A Theoretical Framework and Engineering Methodology For Systematic Prompt Optimization
32 pages
Guidelines On AI
No ratings yet
Guidelines On AI
9 pages
Sensors 24 06140
No ratings yet
Sensors 24 06140
16 pages
2024FuallStackBench Seed
No ratings yet
2024FuallStackBench Seed
26 pages
1Z0-1122-25 Questions and Answers For Certification
100% (1)
1Z0-1122-25 Questions and Answers For Certification
3 pages
AcademyMachineLearningFoundations EN ILT 07
No ratings yet
AcademyMachineLearningFoundations EN ILT 07
34 pages
Line Card
No ratings yet
Line Card
4 pages