Tag: Artificial Intelligence
86 articles
AI Research #135: Gemini 3 Pro Back on Top - MoE, Million-token Context and Deep Think
Explains Gemini 3 Pro's advantages through sparse MoE architecture, million-token context, native multimodal (text/image/video/PDF), thinking depth control (thinking_leve...
AI Research #130: Qwen2.5-Omni Practical Applications
Office assistant, education and training, programming and operations, search-enhanced RAG, device control/plugin agents, and companion entertainment.
AI Research #129: Qwen2.5-Omni-7B Key Specs - VRAM, Context and Deployment
Runs stably at FP16 ~14GB VRAM, with INT8/INT4 quantization (<4GB) enabling deployment on consumer GPUs or edge devices.
AI Research #128: Qwen2.5-Omni Training Pipeline - Three-stage Multi-modal Training
Complete training pipeline breakdown for Qwen2.5-Omni: Thinker based on Qwen2.5, vision initialized from Qwen2.5-VL, audio from Whisper-large-v3.
AI Research #127: Qwen2.5-Omni Deep Dive - Thinker-Talker Dual-core Architecture
Engineering breakdown of Qwen2.5-Omni (2024-2025) Thinker-Talker dual-core architecture: unified Transformer decoder for text/image/video/audio fusion, TMRoPE.
AI Research #125: Tesla FSD Business Model and Competitive Landscape
As of end 2022, Tesla had ~$2.9 billion in FSD-related deferred revenue Q4 2022 recognized $324 million in FSD revenue
AI Research #124: Tesla FSD V14 Deep Analysis
Tesla FSD V14 real-world performance and road tests, comparing V13.2 on urban roads and highways: key disengagement metrics, lane changes/ramps, destination arrival...
AI Research #123: FSD V14 Deep Analysis - Vision-Only SDF vs V12
3D environment reconstruction Precision: 10cm (3× improvement over V12's ~33cm resolution) Multi-frame spatiotemporal fusion for dynamic object tracking
AI Research #121: DeepSeek-OCR Research Directions
Frontier approaches and engineering implementation for DeepSeek-OCR (2025, including 3B parameter direction).
AI Research #119: DeepSeek-OCR PyTorch FlashAttn 2.7.3 Inference and Deployment
Comprehensive guide for DeepSeek-OCR local/private deployment based on Python 3.12, PyTorch 2.6.0, Transformers 4.46.3 and FlashAttention 2.7.3.
AI Research #120: DeepSeek-OCR from 0 to 1 - Getting Started and Engineering Essentials
Complete getting started path and engineering essentials for DeepSeek-OCR (as of 2025), covering environment setup (Python/PyTorch 2.x, Transformers 4.
AI Investigation #107: RL and Robot Training Data Format Analysis
Constructed in state-action-reward sequence form, supporting spatiotemporal understanding of models like Transformers.
AI Investigation #106: Robot Learning Data Collection Tools and Methods - Sensors, APIs, Teleoperation and Simulation
Core data collection methods and application scenarios, covering over ten methods from manual entry, sensor collection, web crawlers, API calls, log collection.
AI Investigation #105: Robot Learning Data Collection - From Demonstration Videos to State-Action Pairs
Data collection is a critical step in robot learning development, covering demonstration video collection, trajectory recording, state-action pair generation...
AI Investigation #103: Embodied AI Technology Landscape
Comprehensive overview of embodied AI tech stack: hardware (GPU, sensors, actuators), software (ROS, simulation), and algorithms (deep learning, RL, VLA models).
AI Investigation #102: Intelligent Robotic Arms, Autonomous Driving and Humanoid Robots - Imitation Learning, Reinforcement Learning and Multimodal Fusion Trends
Different types of robots have huge differences in structure, tasks and control methods, so AI algorithm adaptation strategies also need to be tailored.
AI Investigation #101: Modern AI Methods - VLA, RT-1, RT-2 and Diffusion Models for Robot Control
Modern AI robot control methods are undergoing a major transition from reinforcement learning and imitation learning to multimodal agents driven by large models.
AI Investigation #99: Sensor Fusion Technology - Camera, LiDAR, IMU and Radar Fusion
Sensor Fusion is a core technology in autonomous driving, robotics and smart security.
AI Investigation #98: Visual SLAM - ORB-SLAM, RTAB-Map and VINS-Fusion
Visual SLAM is a technology that achieves autonomous positioning and environment mapping without relying on LiDAR, using only cameras.
AI Investigation #96: Robot Scenario Testing - From Extreme Environments to Real-time Simulation
Complete guide to robot scenario testing, covering three dimensions: environment testing, load testing, and anomaly testing.
AI Investigation #95: Robot Scenario Testing - From Extreme Environment Simulation to Automated Fault Injection
Camera Instant Frame Loss: 5-100ms frame drop LiDAR Noise Surge: Random noise 5-20% IMU Data Jump: 1-3x normal values
AI Investigation #93: Robot Simulation Tools - Comprehensive Comparison from Gazebo to Isaac Sim
Simulation tools are an important part of robot R&D, enabling algorithm verification and system debugging in risk-free environments, accelerating iteration.
AI Investigation #92: Robot Motion Control - From Traditional Models to Deep Learning Methods
Robot motion control can be divided into two categories: traditional model-based methods and deep learning-based intelligent control.
AI Investigation #91: Multi-modal Data Annotation Tools - From Label Studio to 3D Point Cloud Labeling
In robot vision and perception model training, high-quality multi-modal data annotation tools are crucial.
AI Investigation #89: Multi-dimensional Autonomic Nervous System Assessment Beyond HRV
Individual differences in autonomic nervous system (ANS) profoundly affect disease susceptibility.
AI Investigation #88: HRV and the Autonomic Nervous System - Monitoring, Health Meaning and Improvement
Higher HRV: Stronger vagal tone, more relaxed Lower HRV: Chronic stress, fatigue, cardiovascular disease
AI Investigation #87: HRV Time Domain and Frequency Domain Metrics: SDNN, RMSSD, pNN50 and Spectral Analysis
HRV calculation methods mainly include time domain and frequency domain.
AI Investigation #85: Autonomic Nervous System Assessment - HRV, BPV, EDA and Pupillary Reflex
Measurement technology: ECG, PPG Analysis: Time domain, frequency domain, nonlinear
AI Investigation #84: Fat Loss Science - A Practical Guide to Maintaining Low Body Fat
Maintaining low body fat is more challenging than losing it. The human body is naturally inclined to store energy.
AI Investigation #83: Fat Loss Science - Complete Diet Guide
During fat loss, intake is recommended at 1900-2100 kcal, with daily deficit of 300-500 kcal.
AI Investigation #81: Fat Loss Science - What Body Fat Percentage Is Healthy?
Body fat percentage affects both appearance and health directly. Healthy body fat for men is 10%~20%, ideal around 15%; for women it's between 18%~28%.
AI Investigation #80: Why Fat Loss Must Be Science-Based - Energy Balance and Metabolism Explained
The core mechanism of fat loss is energy deficit: when calorie intake is less than consumption, the body uses fat stores for energy.
AI Investigation #78: LFP Lithium Battery - Shallow Charging vs Deep Cycling and Their Impact on Battery Life
Shallow charge frequent and deep discharge charge have significantly different impacts on LFP battery life.
AI Investigation #76: When Robots Enter Life - Embodied AI's Deep Impact on Employment and Social Structure
The widespread application of embodied AI is profoundly changing social structure.
AI Investigation #75: From LLM to LBM - Hierarchical Robot Control Architecture Driven by Large Models
The integration of Large Language Models (LLM) with robot real-time control is driving intelligent upgrades in robotics.
AI Investigation #74: Robot Learning Breakthroughs - Meta-Learning and Sim-to-Real Transfer
This article explores fast learning capabilities of embodied AI agents, focusing on meta-learning and few-shot learning methods.
AI Investigation #73: Embodied AI Future Trends - From Technology Integration to Industrial Deployment
In the next decade, embodied AI will undergo paradigm shifts: centered on 'pre-trained world models + online learning', software-hardware collaboration and interdisciplin...
AI Investigation #72: Embodied AI Development Challenges - Data, Hardware, Compute and Commercialization
Embodied AI development faces six core challenges: data scarcity, hardware limitations, training efficiency, cost bottlenecks, standardization and industrial ecosystem...
AI Investigation #71: Embodied AI Case Studies - From ROS to Tesla Optimus in Open Source and Commercial Practice
Typical practices of embodied AI in architecture, capabilities and applications.
AI Investigation #70: Embodied AI Industry Ecology and Development Trends
Embodied AI is leading a new round of technological revolution. The market size is expected to grow from $2.53 billion in 2024 to $8.76 billion in 2033, with a CAGR of 15...
AI Investigation #59: Robotics Career Map - Development Paths, Skill Requirements and Salary Insights
The robotics industry is experiencing rapid development, with surging demand for interdisciplinary talent with mechanical, electronic, control and software capabilities.
AI Investigation #56: Robotics Technology Iteration - From Hydraulic Drive to AI Collaboration
Robotics has undergone profound evolution from early hydraulic drive and analog control to modern electric drive, digital control and perception systems.
AI Investigation #55: Robotics - A Century of Evolution from Unimate to Humanoid Agents
Since the term 'Robot' was first introduced in 1921, robotics has undergone a century of evolution from science fiction to reality.
AI Investigation #54: Big Data Industry Applications and Technology Selection Trends
Big data has achieved deep integration in finance, e-commerce, internet, communications, manufacturing, healthcare, education and other industries, becoming the core engi...
AI Investigation #53: Big Data Talent Landscape - Experience Distribution, Growth Paths and Industry Trends
The talent structure in the big data industry shows characteristics of youth and rapid growth. The 25-30 age group is the main force, while 30-35 year-olds are gradually...
AI Investigation #52: Big Data Technology Landscape - Lakehouse, Data Mesh and Serverless
Big data technology is undergoing a new wave of transformation. Lakehouse architecture combines the advantages of data lakes and data warehouses.
AI Investigation #51: Big Data Technology Evolution - Obsolete Frameworks, Architectures and the Reasons Behind Them
Big data technology evolution: MapReduce replaced by Spark, Storm replaced by Flink, Pig/Hive gradually phased out.
AI Research 49 - Big Data Survey Report: Development History from 1997 to 2025
Big data development began in 1997 when NASA proposed the concept, 2003-2006 Google published GFS, MapReduce, Bigtable three major papers leading distributed computing re...
AI Research 01 - Is Mindfulness Meditation Effective? Health Benefits and Scientific Guide
Mindfulness meditation is a scientifically validated mind-body regulation method.
AI Research 48 - Traditional Chinese and Western Medicine Research: Complementary Clinical Practice
Compare differences and advantages of Chinese and Western medicine in treating common diseases and chronic diseases.
AI Research 47 - Multi-Dimensional Survey Report: A Systematic Strategy for Nutritional Supplementation
Extra nutrient supplementation should be based on individual assessment, not blind following.
AI Research 46 - Multi-Dimensional Survey Report: Multivitamins Are Not a Cure-All for Chronic Disease Prevention
Multiple large-scale clinical studies show that for ordinary healthy people, long-term intake of multivitamin supplements has limited overall effect in preventing cardiov...
AI Research 43 - Multi-Dimensional Survey Report: Should Young People and Seniors Take Multivitamins?
Multivitamin supplements can fill nutritional gaps caused by unbalanced diet, suitable for elderly, vegetarians, pregnant women and other specific groups.
AI Research 41 - Multimodal Large Model Quantization: Qwen2.5-VL Architecture, Capability Evaluation and Use Cases
Qwen2.5-VL is the new generation multimodal large model launched by Alibaba, significantly leading in visual understanding, video analysis, and cross-modal reasoning.
AI Research 38 - Multimodal Large Model Quantization: Evaluation Strategies for Mainstream Vision-Language Tasks
To systematically evaluate the impact of model quantization on performance, need to combine multiple vision-language datasets and metrics.
AI Research 37 - Multimodal Large Model Quantization: Impact on Vision, Language and Multimodal Tasks
Model quantization compresses FP32 weights into low-precision representations, significantly reducing inference resource consumption.
AI Research 35 - Coffee Price War: Taste and Experience Differences, Why Does Homemade Coffee Taste Better?
The taste experience differences between homemade coffee and chain budget coffee stem from multiple factors including ingredient quality, production process...
AI Research 34 - Coffee Price War: User Preferences and Business Strategies Behind the Beverage Trend
China's chain coffee market is experiencing beverage-ization change—coffee products increasingly becoming milk-tea-like.
AI Research 31 - Programmers Don't Have Bad Memory, They Just Use the Wrong Methods
8 evidence-based strategies for programmers to combat forgetting: Ebbinghaus forgetting curve can be quantified.
AI Research 30 - How Programmers Can Scientifically Combat Forgetting
Programmers often experience 'forgetting after learning'. Effective strategies include: note organization, spaced repetition (Anki), active recall, project practice...
AI Research 29 - Why Does the "Perfect Partner" During Dating Become a "Problem Partner" After Marriage?
Dating focuses on emotional attraction, marriage values responsibility, communication, and values more.
AI Research 28 - A Panorama of Dating Perspectives in China and the US
Dating perspective differences between China and US: first love age is approximately 17-18 in China, 15-17 in US. Number of relationships before marriage:
AI Research 27 - [Time Management] Heavy vs Light Users: How to Reallocate Time?
Heavy social media users have higher rates of depression, anxiety, and loneliness, plus more sleep problems.
AI Research 26 - [Time Management] Social Media Usage Time Across Different Populations
Global netizens spend an average of 2 hours and 21 minutes on social media daily. Chinese users spend 1.55 hours on Douyin, with short video usage peaking at midnight.
AI Research 25 - [Money Management] Savings vs Spending: Should I Save More or Spend More?
Should young people save more or spend more? Experts recommend 'save first, then spend', following the 50/30/20 rule: 50% needs, 30% wants, 20% savings.
AI Research 24 - [Money Management] Survey of Young People's Spending and Saving Across Countries
Comparison of savings levels, consumption structure, and financial concepts among young people in China, the US, and Japan.
AI Research 23 - [Time Management] Strategies and Metrics for Evaluating Time Management
Time management can significantly improve efficiency and sense of achievement.
AI Research 22 - [Time Management] Reasonable Planning and Management Plan
More than 80% of people do not have a formal time management system.
AI Research 21 - [Time Management] Necessity and Practical Improvement Plan
Effective time management can improve efficiency, reduce stress, and improve work-life balance. Scientific evidence includes psychology, behavioral science...
AI Research 20 - [Mindfulness Meditation] Health Benefits and Scientific Guide
Mindfulness meditation is a scientifically validated mind-body practice.
AI Research 19 - Conflict Analysis Between Programmers and Product Managers: Practical Advice
Reducing conflicts between programmers and product managers requires: improving requirements process, enhancing communication skills, aligning goals, agile practices...
AI Research 18 - Conflict Analysis Between Programmers and Product Managers: Conflict Avoidance and Resolution
Resolving conflicts between programmers and product managers requires: establishing clear communication mechanisms, improving document quality, fairly negotiating priorit...
AI Research 17 - Conflict Analysis Between Programmers and Product Managers: Patterns and Influencing Factors
Conflicts between programmers and product managers show a 'bell curve' distribution. Influencing factors include project scale, team experience, company type, work mode...
AI Research 16 - Conflict Analysis Between Programmers and Product Managers
Conflicts between programmers and product managers most frequently occur in four key stages: requirements analysis, development implementation, schedule planning...
AI Research 13 - LLM and Agent Research: The Rise and Development of LLM Agents
2024 is called the 'Year of Agents'. LLM trends show parallel development of 'bigger and stronger' and 'smaller and more specialized'.
AI Research 15 - Methodology, Validation Process, and Direction Verification for Building Products from 0 to 1
Building products from 0 to 1 requires mastering methodologies like Lean Startup and Design Thinking, and validating directions through requirement verification...
AI Research 14 - How to Write Excellent Research Reports, Slide Decks and Technical Sharing
How to write excellent research reports, slide presentations, and technical sharing sessions?
AI Research 12 - LLM and Agent Research: Overview of Major LLM Application Directions
Major LLM application directions in 2024-2025 include enterprise applications (code assistance, customer service, knowledge management) and consumer applications.
AI Research 11 - Running Analysis Research: Nutrition Strategies, Timing and Planning for Endurance and Body Shaping
Running nutrition strategies should focus on carbohydrates (50-65% of daily energy), protein (1.2-1.6g/kg), and fat (20-30%). Consume 15-25g of protein within 30 minutes...
AI Research 09 - Running Analysis and Research: Optimal Running Methods, Post-Run Nutrition and Body Recomposition
The optimal way to run is to keep heart rate between 60% and 80% of maximum heart rate. Post-run nutrition should include carbohydrates and protein promptly.
AI Research 07 - Running Benefits: Effects of 3K, 5K, 10K, Half Marathon and Marathon
Running has significant benefits for cardiovascular health, weight management, and psychological state.
AI Research 06 - The Physiological Health Effects and Mechanisms of Cold Showers
As a health intervention, cold showers can provide benefits such as refreshing, immune enhancement, and recovery promotion when practiced scientifically.
AI Research 05 - The Physiological Health Effects and Mechanisms of Cold Showers: Cold vs Hot Showers
Cold showers have positive effects on mental health, including improved alertness, stress and depression relief, and enhanced psychological resilience.
AI Research 04 - The Physiological Health Effects and Mechanisms of Cold Showers Part 1
Cold showers typically refer to showering with water temperature at or below 20°C, with positive effects on immune function, blood circulation, metabolic rate...
AI Research 03 - Does Technical Time Investment Correlate with Salary Returns?
Reasons for stagnant income despite accumulated technical capabilities include position saturation, increased talent supply.
AI Research 02 - Does Technical Time Investment Correlate with Salary Growth? Part 1
Technical investment and income are generally positively correlated, but returns do not grow infinitely.