Blog

Technical exploration and thoughts · 655 articles

All big-data java ai artificial-intelligence programmer-life machine-learning mysql data-engineering backend distributed data-warehouse flink architecture python robotics spark hive llm distributed-system kafka database scala embodied-ai hdfs deep-learning spring message-queue langchain system-architecture mybatis performance-optimization elasticsearch mongodb health redis spring-boot rabbitmq mq hadoop elk flume stream-processing transaction messaging rpc tutorial sklearn caching cache dubbo java-rabbitmq clickhouse hbase kylin neo4j microservices sql index tomcat programmer multimodal zookeeper druid canal mllib orm ioc nutrition large-model robot-arm tesla indie-dev nginx datax sharding shardingsphere fastdfs rocketmq time-management applications career-growth docker etl guava java-rocketmq optimization learning quantization deployment kudu logstash decision-tree sqoop airflow realtime-warehouse mycat storage-engine consistency fat-loss gpt product-manager coffee business-analysis automation algorithm career-and-growth middleware computer-vision autonomous-driving fsd qwen mapreduce crud monitoring database-sharding distributed-transaction concurrency transaction-pitfalls graph-database memcached netty innodb salary career-development cold-shower running efficiency luckin industry medical industrial lfp lfp-battery battery ev fitness career-personal-growth ocr deepseek deepseek-ocr omni cloud-native yarn datastream jdbc olap knn linear-regression numpy zipper-table griffin devops kubernetes data-mapping design-patterns high-availability read-write-separation sharding-jdbc saga security replica-set cql source-code-analysis evcache servlet aop load-balancing handwritten nio mindfulness meditation exercise reinforcement-learning agent conflict evaluation money-management consumption savings social-media dating memory price-war cotti ptq qat qlora qwen2.5-vl multivitamin calcium evolution technology industrial-robot agriculture hardware simulation ros large-language-model degradation slam visual-inspection programming-language linux window raft kibana aggregation regularization logistic-regression prometheus exporter atlas state-management maven acid annotation-development master-slave-replication flexible-transaction xa cap 2pc 3pc bson explain b+tree slow-query authentication cluster oss aliyun source-code async netflix jms paxos rmi engineering physiology hot-shower practical-guide muscle-building transformer tensorflow reports technical-sharing product entrepreneurship methodology team-collaboration conflict-resolution collaboration gtd tools usage-time health-management china-us culture marriage partner coffee-beverage-trend homemade-coffee taste performance fine-tuning blip-2 minigpt-4 llava alibaba vitamins fish-oil vitamin-c iron folate chronic-disease supplements traditional-chinese-medicine western-medicine integrated-medicine development-history tech-evolution lakehouse data-mesh serverless talent tech-selection history unimate hydraulic-drive ai-collaboration categories service-robot humanoid-robot logistics career skills trends service scara cobot motor reducer sensor plc mpc control trajectory-planning vision core-technology perception decision-making home data market challenges commercialization future-trends meta-learning lifestyle nmc-battery body-fat-percentage body-shaping muscle-gain strength-training body-fat metabolism sympathetic-nerve parasympathetic-nerve autonomic-nervous-system hrv testing-platform api-integration automotive 3d model-y open-source imitation-learning visual-algorithms research java-21 kotlin golang rust javascript node k8s gemini cep source sink dataset mergetree ik-analyzer dsl term-query filter inverted-index nrt grok filebeat tez data-mining cross-validation normalization evaluation-metrics ridge-regression lasso gradient-descent grafana visualization ods scd dimension-table dwd dws ads realtime memory-management parallelism harbor container esp32 home-assistant jenkins gitlab cicd essay website astro frontend xml-mapping dynamic-sql sqlsession high-concurrency mha failover distributed-primary-key scaling binding-tables sql-optimization binding-table tcc seata data-masking distributed-database sharding-proxy sharding-strategy e-r-sharding configuration-file transaction-isolation-level schema.xml propagation declarative-transaction programmatic-transaction transactional plugin database-operations nosql json pipeline pagination writeconcern pagehelper generic-mapper b-tree use-cases selection-guide template repository wiredtiger inmemory containerization data-modeling embedded reference oplog election permissions sharded-cluster graph-theory euler-path proxy-pattern embedded-database backup access-control dynamic-proxy cloud-storage lru concurrenthashmap oom distributed-cache spymemcached activemq blockingqueue message-storage queue-index erlang handwritten-framework jdk reverse-proxy process configuration class-loading ssl jvm io heartbeat-detection spi routing storage-structure undo redo thread-model tablespace binlog replication clustered-index lock mvcc sorting pip offline pandas voice

Hadoop Cluster WordCount Distributed Computing Practice

Complete WordCount execution on Hadoop cluster: upload files to HDFS, submit MapReduce job, view running status through YARN UI, verify true distributed computing.

7/1/2024

big-datahadoopmapreducehdfs

Hadoop JobHistoryServer Configuration and Log Aggregation

Configure Hadoop JobHistoryServer to record MapReduce job execution history, enable YARN log aggregation, view job details and logs via Web UI.

7/1/2024

big-datahadoopmapreduceyarn

Hadoop Cluster SSH Passwordless Login Configuration and D...

Complete guide for Hadoop three-node cluster SSH passwordless login: generate RSA keys, distribute public keys, write rsync cluster distribution script, including pitfall notes and /etc/hosts confi...

6/30/2024

big-datahadooplinuxdistributed-system

Hadoop Cluster Startup and Web UI Verification

Complete startup process for Hadoop three-node cluster: format NameNode, start HDFS and YARN, verify cluster status via Web UI, including start-dfs.sh and start-yarn.sh usage.

6/30/2024

big-datahadoophdfsyarndistributed-system

Basic Environment Setup: Hadoop Cluster

Detailed tutorial on setting up Hadoop cluster environment on 3 cloud servers (2C4G configuration), including HDFS, MapReduce, YARN components introduction, Java and Hadoop environment configuratio...

6/28/2024

big-datahadoopjavadistributed-system

Hadoop Cluster XML Configuration Details

Detailed explanation of Hadoop cluster three-node XML configuration files: core-site.xml, hdfs-site.xml, mapred-site.xml, yarn-site.xml, including NameNode, DataNode, ResourceManager configuration ...

6/28/2024

big-datahadoophdfsdistributed-system

LangChain-26 Custom Agent Complete Tutorial Building a Cu...

This article demonstrates how to create a chat agent using the Langchain library and GPT-4 model in Python by defining tool functions and integrating them with LLM to achieve queries for informatio...

4/15/2024

langchainllmaipython

LangChain-24 AgentExecutor Comprehensive Guide

This article introduces how to use the Langchain library in Python for document retrieval, load web content, configure OpenAIEmbeddings, and integrate GPT-3.5-turbo model for Q&A. It demonstrates h...

4/14/2024

langchainllmaipython

LangChain-25 ReAct Framework Detailed Explanation Integra...

This article introduces ReAct, a framework that uses logical reasoning and action sequences to achieve goal-oriented tasks through LLM decision-making and operations. The core components include Th...

4/14/2024

langchainllmaipython

LangChain-22 Text Embedding and FAISS Practical Explanation

This article introduces the key role of TextEmbedding in NLP, how to convert text into real number vectors to represent semantic relationships, and how to combine OpenAIEmbeddings and FAISS for eff...

4/13/2024

langchainllmaipython

LangChain-23 Vector AI Semantic Search System Vector Data...

This article introduces how to use Chroma vector database to process and retrieve high-dimensional vector embeddings from documents, vectorize them using...

4/13/2024

langchainllmaipython

LangChain-20 Document Loaders TextLoader, CSVLoader, PyPD...

This article introduces various document loaders provided by the LangChain library, such as TextLoader, CSVLoader, DirectoryLoader, etc., demonstrating how to load and process data in various formats.

4/12/2024

langchainllmaipython

LangChain Text Splitter: Character, Word, HTML and Code-b...

This article introduces various TextSplitters in the LangChain library, including character-based, word-based, HTML tag-based, and programming language-based splitters, as well as their application...

4/12/2024

langchainllmaipython

LangChain Cache Mechanism: InMemoryCache and SQLiteCache ...

LangChain provides a comprehensive caching mechanism to significantly reduce LLM call latency and costs. Its core includes InMemoryCache (in-memory cache) and SQLiteCache (persistent cache).

4/11/2024

langchainllmaipython

LangChain-19 TokenUsage Callback Function Explained

Explains how to integrate OpenAI GPT-3 model in Python through LangChain library, demonstrating how to use the `get_openai_callback` function to obtain callbacks and execute requests.

4/11/2024

langchainllmaipython

LangChain-16 Using Tools: Mastering LLM Tool Calling

LangChain is currently one of the most popular LLM application development frameworks, specifically designed for building intelligent assistants, automation...

4/10/2024

langchainllmaipython

LangChain-17 Function Calling AI Function Calling Explained

Function Calling is a core technology for Large Language Models (like GPT-4, Claude, Gemini) to interact with external systems. It enables AI to not only understand language but also execute tasks,...

4/10/2024

langchainllmaipython

LangChain-14 OpenAI Content Moderation (Moderation) Expla...

Content moderation is a core component of modern internet platform safety and compliance, used to identify, filter, and manage user-generated content (UGC) to prevent the spread of illegal, low-qua...

4/9/2024

langchainllmaipython

LangChain-15 Intelligent Knowledge Retrieval: AgentExecut...

Build an intelligent knowledge retrieval system using Wikipedia search plugin, AgentExecutor, and LangChain tools. Covers agent initialization, tool binding, and multi-step reasoning workflows.

4/9/2024

langchainllmaipython

LangChain-12 Routing By Semantic Similarity

This article introduces a method using large models (like OpenAI) and Prompt templates to handle unexpected inputs in program design by calculating the similarity between queries and preset templates.

4/8/2024

langchainllmaipython