Gleam Lab · Blog Archive

Blog Page 32

Technical exploration and engineering notes, 655 articles in total.

Gleam Lab technical blog cover: AI Engineering, Java backend, and long-form writing

All Articles Java243 Backend50 Microservices10 AI Engineering86 LLM35 Big Data271 Data Engineering57 Kubernetes / Cloud Native3 Real-time Voice1 Robotics40 Personal Growth29 More Tags →

Tutorial Series 2 min read Big Data Engineering

Hadoop Cluster WordCount Distributed Computing Practice

Complete WordCount execution on Hadoop cluster: upload files to HDFS, submit MapReduce job, view running status through YARN UI, verify true distributed computing.

7/1/2024

big-datahadoopmapreduce +1

Tutorial Series 2 min read Big Data Engineering

Hadoop JobHistoryServer Configuration and Log Aggregation

Configure Hadoop JobHistoryServer to record MapReduce job execution history, enable YARN log aggregation, view job details and logs via Web UI.

7/1/2024

big-datahadoopmapreduce +1

Tutorial Series 2 min read Big Data Engineering

Hadoop Cluster SSH Passwordless Login Configuration and Distribution Script

Complete guide for Hadoop three-node cluster SSH passwordless login: generate RSA keys, distribute public keys, write rsync cluster distribution script.

6/30/2024

big-datahadooplinux +1

Tutorial Series 2 min read Big Data Engineering

Hadoop Cluster Startup and Web UI Verification

Complete startup process for Hadoop three-node cluster: format NameNode, start HDFS and YARN, verify cluster status via Web UI, including start-dfs.sh and start-yarn.

6/30/2024

big-datahadoophdfs +2

Tutorial Series 2 min read Big Data Engineering

Basic Environment Setup: Hadoop Cluster

This article is migrated from Juejin. Original link: Big Data 01 - Basic Environment Setup

6/28/2024

big-datahadoopjava +1

Tutorial Series 2 min read Big Data Engineering

Hadoop Cluster XML Configuration Details

Detailed explanation of Hadoop cluster three-node XML configuration files: core-site.xml, hdfs-site.xml, mapred-site.xml, yarn-site.

6/28/2024

big-datahadoophdfs +1

Tutorial Series 4 min read LLM Application Development

LangChain-26 Custom Agent Complete Tutorial: Building a Custom Agent

A Custom Agent refers to an intelligent agent program customized by users based on specific requirements, which can execute specific tasks or workflows.

4/15/2024

langchainllmai +1

Tutorial Series 2 min read LLM Application Development

LangChain-24 AgentExecutor Comprehensive Guide

This article introduces how to use the Langchain library in Python for document retrieval, load web content, configure OpenAIEmbeddings, and integrate GPT-3.

4/14/2024

langchainllmai +1

Tutorial Series 2 min read LLM Application Development

LangChain-25 ReAct Framework Detailed Explanation and Integration Practice

This article introduces ReAct, a framework that uses logical reasoning and action sequences to achieve goal-oriented tasks through LLM decision-making and operations.

4/14/2024

langchainllmai +1

Tutorial Series 4 min read LLM Application Development

LangChain-22 Text Embedding and FAISS Practical Explanation

Text Embedding involves the process of mapping high-dimensional data (such as text, images, etc.) to lower-dimensional spaces.

4/13/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain-23 Vector AI Semantic Search System: Vector Databases and Retrieval

Vector Storage, also known as Vector Database, is a database system specifically optimized for storing and retrieving high-dimensional vector data.

4/13/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain-20 Document Loaders: TextLoader, CSVLoader, PyPDFLoader and More

This article introduces various document loaders provided by the LangChain library, such as TextLoader, CSVLoader, DirectoryLoader, etc., demonstrating how to load and pr...

4/12/2024

langchainllmai +1

Tutorial Series 2 min read LLM Application Development

LangChain Text Splitter: Character, Word, HTML and Code-based Splitting

This article introduces various TextSplitters in the LangChain library, including character-based, word-based, HTML tag-based, and programming language-based splitters...

4/12/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain Cache Mechanism: InMemoryCache and SQLiteCache Explained

LangChain provides a comprehensive caching mechanism to significantly reduce LLM call latency and costs. Its core includes InMemoryCache (in-memory cache) and SQLiteCache...

4/11/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain-19 TokenUsage Callback Function Explained

Explains how to integrate OpenAI GPT-3 model in Python through LangChain library, demonstrating how to use the getopenaicallback function to obtain callbacks and execute...

4/11/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain-16 Using Tools: Mastering LLM Tool Calling

LangChain is a powerful open-source framework designed to help developers more efficiently build and deploy applications based on Large Language Models (LLMs).

4/10/2024

langchainllmai +1

Tutorial Series 3 min read LLM Application Development

LangChain-17 Function Calling AI Function Calling Explained

Function Calling is a core technology for Large Language Models (like GPT-4, Claude, Gemini) to interact with external systems.

4/10/2024

langchainllmai +1

Tutorial Series 5 min read LLM Application Development

LangChain-14 OpenAI Content Moderation (Moderation) Explained

Moderation refers to the process of reviewing and managing user-generated content (UGC) through manual or automated means.

4/9/2024

langchainllmai +1

Tutorial Series 2 min read LLM Application Development

LangChain-15 Intelligent Knowledge Retrieval: AgentExecutor Practice

Build an intelligent knowledge retrieval system using Wikipedia search plugin, AgentExecutor, and LangChain tools. Covers agent initialization, tool binding...

4/9/2024

langchainllmai +1

Tutorial Series 4 min read LLM Application Development

LangChain-12 Routing By Semantic Similarity

This article introduces a method using large models (like OpenAI) and Prompt templates to handle unexpected inputs in program design by calculating the similarity between...

4/8/2024

langchainllmai +1