Blog

Technical exploration and thoughts · 655 articles

All big-datajavaartificial-intelligencemysqldata-engineeringbackenddata-warehouseflinkpythonroboticssparkhivellmmachine-learningdistributeddistributed-systemkafkaaiprogrammer-lifehdfsembodied-aideep-learningdatabasespringlangchainscalamybatiselasticsearchmongodbsystem-architectureperformance-optimizationmessage-queuehealtharchitectureredisspring-bootrabbitmqmqhadoopflumestream-processingtransactionelkmessagingrpctutorialsklearncachingcachedubboclickhousehbasejava-rabbitmqkylinneo4jmicroservicessqlindextomcatprogrammermultimodalzookeeperdruidcanalmllibormiocnutritionrobot-armteslaindie-devnginxdataxshardingshardingspherefastdfsrocketmqtime-managementcareer-growthdockeretlguavaoptimizationlearningapplicationslarge-modelquantizationkudulogstashdecision-treesqoopairflowrealtime-warehousemycatstorage-engineconsistencyfat-lossgptproduct-managercoffeebusiness-analysiscareer-and-growthmiddlewarecomputer-visionautonomous-drivingfsdqwenmapreducedeploymentcrudmonitoringdatabase-shardingdistributed-transactionconcurrencytransaction-pitfallsgraph-databasememcachednettyinnodbsalarycareer-developmentcold-showerrunningefficiencyluckinindustrymedicalindustrialautomationalgorithmlfplfp-batterybatteryevfitnesscareer-personal-growthocrdeepseekdeepseek-ocromniprogramming-languagecloud-nativeyarndatastreamjdbcolapknnlinear-regressionzipper-tablegriffindata-mappingdesign-patternshigh-availabilityread-write-separationsharding-jdbcsagasecurityreplica-setcqlsource-code-analysisevcacheservletaopload-balancinghandwrittenniomindfulnessmeditationreinforcement-learningagentconflictevaluationmoney-managementconsumptionsavingssocial-mediadatingmemoryprice-warcottiptqqatqloraqwen2.5-vlmultivitamincalciumevolutionindustrial-robotagriculturehardwaresimulationroslarge-language-modeldegradationslamresearchlinuxwindowraftkibanaaggregationnumpyregularizationlogistic-regressionprometheusexporteratlasstate-managementdevopskubernetesmavenacidannotation-developmentmaster-slave-replicationflexible-transactionxacap2pc3pcbsonexplainb-plus-treeslow-queryauthenticationclusterossaliyunsource-codeasyncnetflixjmsjava-rocketmqpaxosrmiengineeringphysiologyhot-showerpractical-guideexercisemuscle-buildingtransformertensorflowreportstechnical-sharingproductentrepreneurshipmethodologyteam-collaborationconflict-resolutioncollaborationgtdtoolsusage-timehealth-managementchina-usculturemarriagepartnercoffee-beverage-trendhomemade-coffeetasteperformancefine-tuningblip-2minigpt-4llavaalibabavitaminsfish-oilvitamin-cironfolatechronic-diseasesupplementstraditional-chinese-medicinewestern-medicineintegrated-medicinedevelopment-historytech-evolutionlakehousedata-meshserverlesstalenttech-selectionhistoryunimatetechnologyhydraulic-driveai-collaborationcategoriesservice-robothumanoid-robotlogisticscareerskillstrendsservicescaracobotmotorreducersensorplcmpccontroltrajectory-planningvisioncore-technologyperceptiondecision-makinghomedatamarketchallengescommercializationfuture-trendsmeta-learninglifestylenmc-batterybody-fat-percentagebody-shapingmuscle-gainstrength-trainingbody-fatmetabolismsympathetic-nerveparasympathetic-nerveautonomic-nervous-systemhrvtesting-platformapi-integrationautomotive3dmodel-yvisual-inspectionopen-sourceimitation-learningjava-21golangk8sgeminicepsourcesinkdatasetmergetreeik-analyzerdslterm-queryfilterinverted-indexnrtgrokfilebeattezdata-miningcross-validationnormalizationevaluation-metricsridge-regressionlassogradient-descentgrafanavisualizationodsscddimension-tabledwddwsadsrealtimememory-managementparallelismharborcontaineresp32home-assistantjenkinsgitlabcicdessaywebsiteastrofrontendxml-mappingdynamic-sqlsqlsessionhigh-concurrencymhafailoverdistributed-primary-keyscalingbinding-tablessql-optimizationbinding-tabletccseatadata-maskingdistributed-databasesharding-proxysharding-strategye-r-shardingconfiguration-filetransaction-isolation-levelschema.xmlpropagationdeclarative-transactionprogrammatic-transactiontransactionalplugindatabase-operationsnosqljsonpipelinepaginationwriteconcernpagehelpergeneric-mapperb-treeuse-casesselection-guidetemplaterepositorywiredtigerinmemorycontainerizationdata-modelingembeddedreferenceoplogelectionpermissionssharded-clustergraph-theoryeuler-pathproxy-patternembedded-databasebackupaccess-controldynamic-proxycloud-storagelruconcurrenthashmapoomdistributed-cachespymemcachedactivemqblockingqueuemessage-storagequeue-indexerlanghandwritten-frameworkjdkreverse-proxyprocessconfigurationclass-loadingssljvmioheartbeat-detectionspiroutingstorage-structureundoredothread-modeltablespacebinlogreplicationclustered-indexlockmvccsortingpipofflinevoice

Hadoop Cluster WordCount Distributed Computing Practice

Complete WordCount execution on Hadoop cluster: upload files to HDFS, submit MapReduce job, view running status through YARN UI, verify true distributed computing.

Hadoop JobHistoryServer Configuration and Log Aggregation

Configure Hadoop JobHistoryServer to record MapReduce job execution history, enable YARN log aggregation, view job details and logs via Web UI.

Hadoop Cluster SSH Passwordless Login Configuration and Distribution Script

Complete guide for Hadoop three-node cluster SSH passwordless login: generate RSA keys, distribute public keys, write rsync cluster distribution script.

Hadoop Cluster Startup and Web UI Verification

Complete startup process for Hadoop three-node cluster: format NameNode, start HDFS and YARN, verify cluster status via Web UI, including start-dfs.sh and start-yarn.

Basic Environment Setup: Hadoop Cluster

This article is migrated from Juejin. Original link: Big Data 01 - Basic Environment Setup

Hadoop Cluster XML Configuration Details

Detailed explanation of Hadoop cluster three-node XML configuration files: core-site.xml, hdfs-site.xml, mapred-site.xml, yarn-site.

LangChain-26 Custom Agent Complete Tutorial: Building a Custom Agent

A Custom Agent refers to an intelligent agent program customized by users based on specific requirements, which can execute specific tasks or workflows.

LangChain-24 AgentExecutor Comprehensive Guide

This article introduces how to use the Langchain library in Python for document retrieval, load web content, configure OpenAIEmbeddings, and integrate GPT-3.

LangChain-25 ReAct Framework Detailed Explanation and Integration Practice

This article introduces ReAct, a framework that uses logical reasoning and action sequences to achieve goal-oriented tasks through LLM decision-making and operations.

LangChain-22 Text Embedding and FAISS Practical Explanation

Text Embedding involves the process of mapping high-dimensional data (such as text, images, etc.) to lower-dimensional spaces.

LangChain-23 Vector AI Semantic Search System: Vector Databases and Retrieval

Vector Storage, also known as Vector Database, is a database system specifically optimized for storing and retrieving high-dimensional vector data.

LangChain-20 Document Loaders: TextLoader, CSVLoader, PyPDFLoader and More

This article introduces various document loaders provided by the LangChain library, such as TextLoader, CSVLoader, DirectoryLoader, etc., demonstrating how to load and pr...

LangChain Text Splitter: Character, Word, HTML and Code-based Splitting

This article introduces various TextSplitters in the LangChain library, including character-based, word-based, HTML tag-based, and programming language-based splitters...

LangChain Cache Mechanism: InMemoryCache and SQLiteCache Explained

LangChain provides a comprehensive caching mechanism to significantly reduce LLM call latency and costs. Its core includes InMemoryCache (in-memory cache) and SQLiteCache...

LangChain-19 TokenUsage Callback Function Explained

Explains how to integrate OpenAI GPT-3 model in Python through LangChain library, demonstrating how to use the getopenaicallback function to obtain callbacks and execute...

LangChain-16 Using Tools: Mastering LLM Tool Calling

LangChain is a powerful open-source framework designed to help developers more efficiently build and deploy applications based on Large Language Models (LLMs).

LangChain-17 Function Calling AI Function Calling Explained

Function Calling is a core technology for Large Language Models (like GPT-4, Claude, Gemini) to interact with external systems.

LangChain-14 OpenAI Content Moderation (Moderation) Explained

Moderation refers to the process of reviewing and managing user-generated content (UGC) through manual or automated means.

LangChain-15 Intelligent Knowledge Retrieval: AgentExecutor Practice

Build an intelligent knowledge retrieval system using Wikipedia search plugin, AgentExecutor, and LangChain tools. Covers agent initialization, tool binding...

LangChain-12 Routing By Semantic Similarity

This article introduces a method using large models (like OpenAI) and Prompt templates to handle unexpected inputs in program design by calculating the similarity between...