Blog

Technical exploration and thoughts · 655 articles

All big-datajavaaiartificial-intelligenceprogrammer-lifemachine-learningmysqldata-engineeringbackenddistributeddata-warehouseflinkarchitecturepythonroboticssparkhivellmdistributed-systemkafkadatabasescalaembodied-aihdfsdeep-learningspringmessage-queuelangchainsystem-architecturemybatisperformance-optimizationelasticsearchmongodbhealthredisspring-bootrabbitmqmqhadoopelkflumestream-processingtransactionmessagingrpctutorialsklearncachingcachedubbojava-rabbitmqclickhousehbasekylinneo4jmicroservicessqlindextomcatprogrammermultimodalzookeeperdruidcanalmllibormiocnutritionlarge-modelrobot-armteslaindie-devnginxdataxshardingshardingspherefastdfsrocketmqtime-managementapplicationscareer-growthdockeretlguavajava-rocketmqoptimizationlearningquantizationdeploymentkudulogstashdecision-treesqoopairflowrealtime-warehousemycatstorage-engineconsistencyfat-lossgptproduct-managercoffeebusiness-analysisautomationalgorithmcareer-and-growthmiddlewarecomputer-visionautonomous-drivingfsdqwenmapreducecrudmonitoringdatabase-shardingdistributed-transactionconcurrencytransaction-pitfallsgraph-databasememcachednettyinnodbsalarycareer-developmentcold-showerrunningefficiencyluckinindustrymedicalindustriallfplfp-batterybatteryevfitnesscareer-personal-growthocrdeepseekdeepseek-ocromnicloud-nativeyarndatastreamjdbcolapknnlinear-regressionnumpyzipper-tablegriffindevopskubernetesdata-mappingdesign-patternshigh-availabilityread-write-separationsharding-jdbcsagasecurityreplica-setcqlsource-code-analysisevcacheservletaopload-balancinghandwrittenniomindfulnessmeditationexercisereinforcement-learningagentconflictevaluationmoney-managementconsumptionsavingssocial-mediadatingmemoryprice-warcottiptqqatqloraqwen2.5-vlmultivitamincalciumevolutiontechnologyindustrial-robotagriculturehardwaresimulationroslarge-language-modeldegradationslamvisual-inspectionprogramming-languagelinuxwindowraftkibanaaggregationregularizationlogistic-regressionprometheusexporteratlasstate-managementmavenacidannotation-developmentmaster-slave-replicationflexible-transactionxacap2pc3pcbsonexplainb+treeslow-queryauthenticationclusterossaliyunsource-codeasyncnetflixjmspaxosrmiengineeringphysiologyhot-showerpractical-guidemuscle-buildingtransformertensorflowreportstechnical-sharingproductentrepreneurshipmethodologyteam-collaborationconflict-resolutioncollaborationgtdtoolsusage-timehealth-managementchina-usculturemarriagepartnercoffee-beverage-trendhomemade-coffeetasteperformancefine-tuningblip-2minigpt-4llavaalibabavitaminsfish-oilvitamin-cironfolatechronic-diseasesupplementstraditional-chinese-medicinewestern-medicineintegrated-medicinedevelopment-historytech-evolutionlakehousedata-meshserverlesstalenttech-selectionhistoryunimatehydraulic-driveai-collaborationcategoriesservice-robothumanoid-robotlogisticscareerskillstrendsservicescaracobotmotorreducersensorplcmpccontroltrajectory-planningvisioncore-technologyperceptiondecision-makinghomedatamarketchallengescommercializationfuture-trendsmeta-learninglifestylenmc-batterybody-fat-percentagebody-shapingmuscle-gainstrength-trainingbody-fatmetabolismsympathetic-nerveparasympathetic-nerveautonomic-nervous-systemhrvtesting-platformapi-integrationautomotive3dmodel-yopen-sourceimitation-learningvisual-algorithmsresearchjava-21kotlingolangrustjavascriptnodek8sgeminicepsourcesinkdatasetmergetreeik-analyzerdslterm-queryfilterinverted-indexnrtgrokfilebeattezdata-miningcross-validationnormalizationevaluation-metricsridge-regressionlassogradient-descentgrafanavisualizationodsscddimension-tabledwddwsadsrealtimememory-managementparallelismharborcontaineresp32home-assistantjenkinsgitlabcicdessaywebsiteastrofrontendxml-mappingdynamic-sqlsqlsessionhigh-concurrencymhafailoverdistributed-primary-keyscalingbinding-tablessql-optimizationbinding-tabletccseatadata-maskingdistributed-databasesharding-proxysharding-strategye-r-shardingconfiguration-filetransaction-isolation-levelschema.xmlpropagationdeclarative-transactionprogrammatic-transactiontransactionalplugindatabase-operationsnosqljsonpipelinepaginationwriteconcernpagehelpergeneric-mapperb-treeuse-casesselection-guidetemplaterepositorywiredtigerinmemorycontainerizationdata-modelingembeddedreferenceoplogelectionpermissionssharded-clustergraph-theoryeuler-pathproxy-patternembedded-databasebackupaccess-controldynamic-proxycloud-storagelruconcurrenthashmapoomdistributed-cachespymemcachedactivemqblockingqueuemessage-storagequeue-indexerlanghandwritten-frameworkjdkreverse-proxyprocessconfigurationclass-loadingssljvmioheartbeat-detectionspiroutingstorage-structureundoredothread-modeltablespacebinlogreplicationclustered-indexlockmvccsortingpipofflinepandasvoice

Hadoop Cluster WordCount Distributed Computing Practice

Complete WordCount execution on Hadoop cluster: upload files to HDFS, submit MapReduce job, view running status through YARN UI, verify true distributed computing.

Hadoop JobHistoryServer Configuration and Log Aggregation

Configure Hadoop JobHistoryServer to record MapReduce job execution history, enable YARN log aggregation, view job details and logs via Web UI.

Hadoop Cluster SSH Passwordless Login Configuration and D...

Complete guide for Hadoop three-node cluster SSH passwordless login: generate RSA keys, distribute public keys, write rsync cluster distribution script, including pitfall notes and /etc/hosts confi...

Hadoop Cluster Startup and Web UI Verification

Complete startup process for Hadoop three-node cluster: format NameNode, start HDFS and YARN, verify cluster status via Web UI, including start-dfs.sh and start-yarn.sh usage.

Basic Environment Setup: Hadoop Cluster

Detailed tutorial on setting up Hadoop cluster environment on 3 cloud servers (2C4G configuration), including HDFS, MapReduce, YARN components introduction, Java and Hadoop environment configuratio...

Hadoop Cluster XML Configuration Details

Detailed explanation of Hadoop cluster three-node XML configuration files: core-site.xml, hdfs-site.xml, mapred-site.xml, yarn-site.xml, including NameNode, DataNode, ResourceManager configuration ...

LangChain-26 Custom Agent Complete Tutorial Building a Cu...

This article demonstrates how to create a chat agent using the Langchain library and GPT-4 model in Python by defining tool functions and integrating them with LLM to achieve queries for informatio...

LangChain-24 AgentExecutor Comprehensive Guide

This article introduces how to use the Langchain library in Python for document retrieval, load web content, configure OpenAIEmbeddings, and integrate GPT-3.5-turbo model for Q&A. It demonstrates h...

LangChain-25 ReAct Framework Detailed Explanation Integra...

This article introduces ReAct, a framework that uses logical reasoning and action sequences to achieve goal-oriented tasks through LLM decision-making and operations. The core components include Th...

LangChain-22 Text Embedding and FAISS Practical Explanation

This article introduces the key role of TextEmbedding in NLP, how to convert text into real number vectors to represent semantic relationships, and how to combine OpenAIEmbeddings and FAISS for eff...

LangChain-23 Vector AI Semantic Search System Vector Data...

This article introduces how to use Chroma vector database to process and retrieve high-dimensional vector embeddings from documents, vectorize them using...

LangChain-20 Document Loaders TextLoader, CSVLoader, PyPD...

This article introduces various document loaders provided by the LangChain library, such as TextLoader, CSVLoader, DirectoryLoader, etc., demonstrating how to load and process data in various formats.

LangChain Text Splitter: Character, Word, HTML and Code-b...

This article introduces various TextSplitters in the LangChain library, including character-based, word-based, HTML tag-based, and programming language-based splitters, as well as their application...

LangChain Cache Mechanism: InMemoryCache and SQLiteCache ...

LangChain provides a comprehensive caching mechanism to significantly reduce LLM call latency and costs. Its core includes InMemoryCache (in-memory cache) and SQLiteCache (persistent cache).

LangChain-19 TokenUsage Callback Function Explained

Explains how to integrate OpenAI GPT-3 model in Python through LangChain library, demonstrating how to use the `get_openai_callback` function to obtain callbacks and execute requests.

LangChain-16 Using Tools: Mastering LLM Tool Calling

LangChain is currently one of the most popular LLM application development frameworks, specifically designed for building intelligent assistants, automation...

LangChain-17 Function Calling AI Function Calling Explained

Function Calling is a core technology for Large Language Models (like GPT-4, Claude, Gemini) to interact with external systems. It enables AI to not only understand language but also execute tasks,...

LangChain-14 OpenAI Content Moderation (Moderation) Expla...

Content moderation is a core component of modern internet platform safety and compliance, used to identify, filter, and manage user-generated content (UGC) to prevent the spread of illegal, low-qua...

LangChain-15 Intelligent Knowledge Retrieval: AgentExecut...

Build an intelligent knowledge retrieval system using Wikipedia search plugin, AgentExecutor, and LangChain tools. Covers agent initialization, tool binding, and multi-step reasoning workflows.

LangChain-12 Routing By Semantic Similarity

This article introduces a method using large models (like OpenAI) and Prompt templates to handle unexpected inputs in program design by calculating the similarity between queries and preset templates.