Technology
Following the progress of AI development is a bit like trying to tell the time by watching the second hand: such is the pace of progress that no one person can humanly track it for long. There exists, however, a notable lag between research breakthroughs and real-world applications, providing an opportunity to understand the broad technology landscape before these advances become ubiquitous. This section explores the current state of AI technology, its applications, challenges, and future directions.
Click on the section titles below to read more. Relevant links in the footnotes (‘References’), although NB some are behind paywalls.
Large Language Models (LLMs)
- Fundamental Capabilities: These models process and generate human-like text by training on vast datasets of written content. Their ability to understand context and generate coherent responses has revolutionised human-computer interaction, enabling everything from sophisticated chatbots to complex analysis tools. The technology represents a fundamental shift in how machines process and generate human language, moving from rule-based systems to more flexible, context-aware approaches1.
- GPT Models: Generative Pre-trained Transformers (GPT) represent a revolutionary approach to language model development:
- Architecture and Evolution:
- Transformer Technology: The architecture uses attention mechanisms to process text by understanding relationships between words in context. This capability enables these models to generate coherent, contextually appropriate responses across a wide range of tasks, fundamentally changing how machines process and generate human language2.
- Scaling Advances: Each generation of GPT models has demonstrated how increasing model size and training data can lead to emergent capabilities. From OpenAI’s GPT-3’s 175 billion parameters to more recent iterations, these scaling advances have revealed that for now larger models can develop abilities not explicitly programmed, including complex reasoning and task adaptation3.
- Capabilities and Applications: GPT models have demonstrated remarkable versatility:
- Large language models like GPT showcase remarkable versatility through their ability to adapt to diverse tasks without specific training and understand natural language instructions, enabling everything from writing to coding. Their capabilities have expanded to include multimodal processing, allowing them to work with various content types like text, code, images, and audio. This adaptability and multimodal integration represents a significant advancement in making these models more useful and accessible across different applications4.
- Architecture and Evolution:
Multimodal Models
- Integrated Processing: These systems represent an evolution in AI capability, simultaneously handling multiple types of data including text, images, audio, and video. This advancement mirrors human cognitive processes, where information from different senses is processed and integrated to form comprehensive understanding. The technology enables more natural and intuitive human-AI interaction by processing information in ways that more closely match human perception5.
- Advanced Applications: The ability to process multiple data types has opened new possibilities across various fields:
- Medical imaging systems that can discuss diagnoses while highlighting relevant visual features
- Educational platforms that combine visual, auditory, and textual learning materials
- Creative tools that can generate and modify content across different media types
- Security systems that integrate visual, audio, and behavioural analysis
Small Language Models (SLMs)
- Efficient Computing: These models represent a crucial evolution in AI system efficiency, designed to run effectively on local devices. Despite their reduced size, recent advances in model architecture and training techniques have allowed them to maintain impressive performance while dramatically reducing computational requirements6. This development is particularly significant for edge computing applications and resource-constrained environments.
- Edge Implementation: The advancement of SLMs has enabled local processing that enhances privacy and reduces latency by keeping sensitive data on-device (e.g. your mobile phone or Internet of Things (IoT) devices), enabling sophisticated AI capabilities without cloud dependence. This also allows deployment in areas with limited internet connectivity, democratising access to AI technology, and reduces energy consumption and carbon footprint compared to larger models7
Diffusion Models
- Visual Generation Process: These models employ an innovative approach that gradually removes noise from random data to generate high-quality images and videos. This process, which effectively reverses entropy, allows the model to learn and reproduce the underlying patterns that compose visual content. The technology has shown remarkable capability in understanding and generating complex visual information.
- Creative Applications: The impact on digital art and design has democratised access to sophisticated art creation through tools like Midjourney. These tools enable professional-grade image editing and manipulation capabilities via increasingly sophisticated video generation and modification tools. They are already being integrated into commercial design workflows, particularly in advertising and product visualisation8.
Agentic AI
- Agent Architecture: Agentic AI represents a significant evolution in artificial intelligence, where systems operate as autonomous agents capable of pursuing defined goals independently. These systems differ from traditional AI models by maintaining ongoing states, learning from interactions, and adapting their behaviour based on experience and feedback. The emergence of agentic AI marks a shift from passive, response-based systems to proactive, goal-oriented artificial intelligence9.
- Multi-Agent Systems: The development of agent swarms and collaborative AI networks is creating new possibilities:
- Collective Intelligence: Multiple AI agents can work together to solve complex problems, each specialising in different aspects of a task while coordinating their efforts. This approach mirrors natural systems where collective behaviour emerges from individual actions, often achieving results beyond the capabilities of single agents10.
- Adaptive Behaviour: Agent networks can dynamically reorganise and adjust their strategies based on changing conditions and requirements. This flexibility allows for more robust and resilient AI systems that can handle unexpected situations and evolving challenges11.
- Enterprise Applications: Agentic AI is transforming business operations:
- Workflow Automation: Companies like Microsoft and Salesforce are developing agentic platforms that can autonomously manage complex business processes. These systems can understand context, make decisions, and execute tasks with minimal human intervention, representing a significant advancement over traditional automation tools12 13.
- Collaborative Frameworks: Frameworks like OpenAI’s Swarm and AutoGen enable the creation of sophisticated agent networks that can work together on complex tasks. These systems demonstrate how multiple specialised agents can collaborate effectively, often achieving better results than single, more general-purpose AI systems14.
- Development Trends: The field is rapidly evolving with new capabilities:
- Reasoning Capabilities: Modern agents incorporate advanced reasoning mechanisms that allow them to break down complex tasks, plan sequences of actions, and adapt to new situations. This enhanced cognitive capability enables them to handle increasingly sophisticated assignments with greater autonomy15.
- Environmental Interaction: Agents are becoming more adept at interacting with their environment, whether digital or physical, through improved sensing, processing, and response mechanisms. This advancement is particularly important for applications in robotics and real-world problem solving.
Recent Developments: The field has seen remarkable advances in recent months:
-
- OpenAI’s ‘o3’ Model16: Although not yet publicly available at the time of writing (Jan 2025) and likely to be the full version of the earlier ‘o1 preview’ model which incorporated Chain of Thought processing, this further breakthrough represents a significant advancement in AI reasoning capabilities by a mainstream product. The model’s methodology breaks down complex problems into smaller, manageable steps, similar to human thinking patterns.
- Google’s Gemini 2.017: This model demonstrates improved reliability and context handling, with the ability to process and understand much longer sequences of information. This advancement brings us an AI system that can maintain coherent understanding across extended conversations and complex documents.
- Anthropic’s Claude 3.5 Sonnet (new)18: Although relatively old by now, the latest iteration shows enhanced reasoning capabilities and improved alignment with human values, and an API that can operate your PC mimicking the keyboard and mouse. Its Constitutional AI framework helps ensure outputs are both helpful and ethically sound.
Integrated Solutions
- Technology Convergence: Microsoft Copilot exemplifies how different AI technologies can be combined to create more powerful and versatile tools. This integration demonstrates the multiplicative effect of combining various AI capabilities, where the whole becomes greater than the sum of its parts19.
- Enhanced Capabilities: Integrated systems offer seamless transition between different types of tasks and data formats making them more intuitive. They generally offer improved problem-solving through multiple approaches and show greater adaptability to various use cases and requirements20
Applications and Capabilities
Advanced Robotics Integration
- Manufacturing Transformation: The integration of multi-modal LLMs with robotics has fundamentally changed manufacturing capabilities. With over 4 million robots now operating in factories worldwide, these systems demonstrate unprecedented adaptability in understanding and executing complex instructions. This convergence of AI and robotics has enabled more flexible and responsive manufacturing processes than previously possible21.
- Healthcare Applications: Medical robotics has achieved significant breakthroughs:
- Surgical robots that combine precision engineering with AI-driven decision support, reducing procedural risks and improving outcomes22
- Laboratory automation systems that accelerate research while maintaining exceptional accuracy23
- Patient care robots that provide consistent support while adapting to individual needs24
Simulation and Digital Twins
- Physics Modelling: AI’s capability to simulate real-world physics (aka ‘world models’) has reached unprecedented levels of sophistication. These advances are transforming multiple fields, from engineering to climate science, with experts predicting the emergence of a general-purpose robotics model within two years that could revolutionise how we develop and test physical systems25.
- Virtual Environments: Advanced simulation capabilities are reshaping industry practices. Digital twins provide real-time monitoring and predictive maintenance and virtual testing environments reduce development costs and accelerate innovation. Similarly, risk assessment simulations enhance safety planning and disaster preparedness, whilst training systems can be deployed that enable rapid iteration without physical risk26.
Cognitive Computing
- Enhanced Reasoning: Recent breakthroughs in Chain of Thought processing, exemplified by OpenAI’s ‘o1′ and ’03’ models, represent a fundamental advance in AI reasoning capabilities. This technology enables AI systems to break down complex problems into manageable steps and demonstrate their reasoning process, making decision-making more transparent and verifiable27.
- Decision Support: Advanced cognitive capabilities have expanded AI applications to handle complex problem decomposition that mirrors human analytical processes, and transparent reasoning chains that enable verification and adjustment. These capabilities can improve the AI system’s ability to handle uncertainty and ambiguity28.
System Integration
- Autonomous Operation: The development of sophisticated computer interaction capabilities, exemplified by Anthropic’s ‘computer use’ feature which can use a screen and mouse (as does Google’s ‘Jarvis’), marks a significant evolution in AI’s ability to operate independently within computer systems. This advancement represents a shift from AI as an advisory tool to an active system participant29.
- Process Automation: These capabilities enable new levels of automation such as end-to-end workflow management, that reduces human intervention, and intelligent process optimisation that improves efficiency. Other applications are error detection and correction, to improve reliability, and adaptive systems that respond to changing conditions30.
Cross-Industry Implementation
- Sector Transformation: AI capabilities are revolutionising traditional industries, from financial services leveraging AI for risk assessment and fraud detection, to educational institutions implementing personalised learning systems. Agricultural operations are using AI for precision farming techniques, whilst energy grids can optimise distribution through predictive analytics31.
- Emerging Applications: Innovation continues across sectors such as legal services automating document analysis and case research, creative industries developing AI-assisted content creation tools and urban planning implementing intelligent traffic management systems32.
Data and Training Constraints
- Resource Limitations: The finite nature of high-quality training data presents a fundamental challenge for AI development. The Welch Labs Efficient Compute Frontier research suggests a Moore’s Law-like relationship for data requirements, indicating that the demand for quality training materials is growing exponentially while suitable data sources remain limited. This constraint is particularly acute for specialised domains where high-quality data is scarce33.
- Quality and Bias Issues: Data challenges extend beyond quantity to complex quality concerns, such as historical biases in existing datasets that can perpetuate societal prejudices and difficulties in ensuring comprehensive representation across diverse populations. There are also privacy restrictions limiting access to valuable real-world data, and challenges in validating synthetic data generation methods34.
Computational Efficiency
- Energy Requirements: The increasing computational demands of AI systems present significant challenges as energy consumption in model training and deployment grows. In addition to system power, cooling systems are required for high-performance computing. This raises concern about the carbon footprint of AI operations and resource competition with other industrial sectors35.
- Infrastructure Solutions: However multiple approaches are being developed to address these challenges. Advanced inference chips from companies like Groq maximise processing efficiency and the implementation of 1-Bit LLMs reduce computational requirements. Reversible computing techniques are being developed to minimise energy loss and the integration of photonic chip technology enables faster, more efficient processing36.
System Architecture Challenges
- Scaling Issues: Current AI architectures face significant challenges in scaling as there’s an exponential increase in computational requirements with model size, in other words there are diminishing returns in performance improvements (and investment). There can be difficulties in maintaining model coherence at scale, and challenges in distributed training coordination37.
- Integration Complexities: Practical implementation faces numerous obstacles, for example compatibility issues with existing infrastructure and challenges in maintaining system reliability. Security vulnerabilities in distributed systems are also a concern as is performance variability across different deployments38.
Reliability and Consistency
- Stochastic Limitations: The probabilistic nature of current AI systems introduces inherent reliability challenges that generate inconsistent outputs from identical inputs under different conditions, which causes a difficulty in guaranteeing specific performance levels. Although improvements are being made, for example using meta-reasoning and ‘chain-of-thought’, there are still challenges in error detection and correction, and limited ability to provide deterministic results39.
- Information Integrity: AI systems can inadvertently create and propagate misinformation due to hallucination phenomena in language models that generate plausible but false content. Integrity can also be compromised via cumulative error amplification in iterative processes, and training data that has been contaminated by AI-generated content. Furthermore AI systems can struggle to distinguish authentic from synthetic information40.
Technical Paradoxes
- Moravec’s Paradox: This fundamental challenge highlights a crucial contradiction in AI development in that tasks requiring sophisticated human reasoning can be relatively simple for AI, yet basic human sensorimotor skills prove extremely challenging for AI systems. Intuitive human knowledge is difficult to replicate in machines and common sense reasoning still remains a significant challenge41
- Implementation Gap: The transition from laboratory to real-world applications also reveals significant challenges. There is a risk of performance degradation in uncontrolled environments and it can be difficult to maintain reliability across diverse scenarios. There are challenges both in adapting to unexpected situations, and integration issues with existing workflows and processes42
Environmental Impact
- Resource Consumption: AI development and deployment create significant environmental challenges, including increasing water usage for cooling systems, growing energy demands for data centres, electronic waste from hardware upgrades and raw material requirements for infrastructure expansion43.
- Sustainability Initiatives: The industry is developing various approaches to address environmental concerns, from the implementation of more efficient cooling technologies and the development of energy-efficient computing architectures, to the exploration and implementation of sustainable power sources. They are also adopting the approach of the circular economy to hardware lifecycle management44.
Evolution Toward AGI
- Development Framework: The path toward Artificial General Intelligence (AGI) is being mapped through structured frameworks such as OpenAI’s five-level progression model. While their latest models have reportedly reached the ‘reasoning’ level, there remains significant debate within the AI community about what constitutes genuine progress toward AGI and how it should be measured. This uncertainty reflects the complex nature of human intelligence and the challenges in replicating it45.
- Current State Assessment: The field could be considered to be in the ‘Advanced Cognitive AI’ stage, characterised by improved multi-domain competence and enhanced agentic capabilities. Recent models, alongside their established multi-modal functionality, also demonstrate sophisticated reasoning and adaptive learning capabilities46.
Assessment and Validation
- Evaluation Methodologies: Traditional benchmarking approaches are undergoing significant revision as their limitations become apparent. New initiatives like the ARC prize and ‘Humanity’s Last Exam’ are pioneering more comprehensive evaluation methods that better reflect real-world AI capabilities and potential risks. These new approaches focus on facets such as holistic performance assessment across multiple domains, verification of safety and ethical compliance, robustness under adverse conditions and long-term impact evaluation47.
Architectural Innovation
- Distributed Intelligence: A growing body of research suggests that AGI might emerge from the collaboration of billions of specialised AI agents rather than a monolithic system. This distributed approach could better mirror the modular nature of human cognition and provide inherent safety through distributed control. It could also enable more flexible and adaptable systems and facilitate easier updating and maintenance48.
- Novel Approaches: Different technical paths to advanced AI are also being explored, examples being Meta’s objective-driven I-JEPA architecture for more natural learning and Anthropic’s Constitutional AI frameworks to embed ethical constraints. There are also hybrid systems that combine symbolic and neural processing, and evolutionary approaches for adaptive learning capabilities49.
Safety and Control
- Enhanced Safeguards: Control mechanisms are evolving rapidly to address advanced AI systems’ challenges. Current approaches include multi-layered safety protocols with AI-based safety systems, real-time monitoring with automated mitigation actions, ethical boundary enforcement through specialised AI models, and explainable AI (XAI) for transparent decision auditing50.
- Training Protocols: Advanced safety measures encompass AI-driven data validation and synthetic data generation, automated bias and toxicity detection tools, value alignment through multi-agent systems, and comprehensive transparency frameworks with regulatory compliance51.
Research and Development
- Testing Environments: Following guidance from industry leaders like Demis Hassabis, sophisticated sandbox environments are being developed for safe AI testing. These environments enable controlled behaviour observation and analysis and risk assessment without real-world consequences. They also enable systematic capability evaluation and safety protocol validation52.
- Monitoring Systems: Advanced oversight focuses on behavioural pattern analysis post-deployment and emergency intervention capabilities. It also covers long-term stability assessment and interaction effect monitoring53.
Technological Convergence
- Quantum Integration: The synthesis of AI with quantum computing would present unprecedented opportunities. The technology would give us enhanced problem-solving capabilities and more sophisticated modelling and simulation. It would also reveal novel approaches to optimisation and potential breakthroughs in learning algorithms54.
- Cross-Domain Integration: Future developments are expected to include advanced neural interface technologies, sophisticated autonomous systems, enhanced environmental modelling and revolutionary computing architectures55.
Public Engagement
- Trust Development: Building public confidence in AI systems remains crucial for widespread adoption. The industry is being encouraged to implement transparent development processes and to communicate the capabilities and limitations of different systems. There are also moves for demonstrable safety protocols and regular public engagement initiative56.
- Societal Integration: Successful implementation requires a clear articulation of AI benefits and risks, and ethical deployment frameworks. There should be active stakeholder engagement and ongoing public dialogue and feedback57
Here are some of the developments in AI that highlight the rapid integration of the technology across various sectors, from consumer electronics to industrial applications, showcasing the technology’s increasing impact on everyday life and business operations:
- Humanoid Robots in Industry: Humanoid robotics are transforming industries with advanced dexterity and precision. Companies like Clone Robotics, Unitree Robotics, and Fourier Intelligence are developing robots with lifelike movements and AI-driven capabilities for applications in healthcare, manufacturing, and service industries58. I’ve written about embodied AI systems in a blog post here.
- AI in Consumer Electronics: The CES 2025 showcased a wide range of AI-equipped consumer products. LG and Samsung introduced smart TVs with built-in AI tools for improved picture quality and user interaction. These advancements demonstrate the increasing integration of AI into everyday consumer devices59
- Autonomous Driving Progress: While fully autonomous vehicles are not yet mainstream, significant advancements have been made in assisted driving systems. The industry is focusing on enhancing ADAS (Advanced Driver Assistance Systems) and HAD (Highly Automated Driving) features, with improved sensor integration for safer and more efficient driving experiences58
- AI in Customer Service: Generative AI is revolutionising customer service by producing dynamic, context-aware responses during interactions. This technology allows for personalised replies targeting specific customer needs in real-time, significantly improving the efficiency and quality of customer support58
- Agentic AI Applications: Agentic AI, which involves AI systems capable of autonomous decision-making and action, is gaining traction. Examples include self-driving cars and robot vacuums, which gather input from sensors and cameras to make decisions and perform tasks without human intervention58. I’ve written about screen-based agentic AI system, Manus, in a blog post here.
- AI in Business Intelligence: December 2024 saw significant advances in AI-powered business intelligence tools. Companies like Hyland, Databricks, and Liquid AI introduced enhanced platforms for improved information search, data analytics, and workflow optimization, demonstrating AI’s growing role in business decision-making processes58
- “Vibe coding”: Coined by computer scientist Andrej Karpathy, this AI-dependent approach allows software programmers to describe their intentions in natural language and let an artificial intelligence system generate most of the actual code60
- Follow the Money: on the basis that the activities of experienced investors provide clues as to the future direction of AI technology, US funder ‘Y Combinator’ Spring 2025 ‘Requests for Startups’ summary is informative61
References
A Short History of AI (Stanford University, 2016)
Attention Is All You Need (Wikipedia)
A brief history of LLM Scaling Laws and […] (Jonas Vetterle, Dec 24)
Your Ultimate Guide to Large Language Models (MarkovML, Feb 24)
Multimodal generative AI systems (Meta, Dec 23)
Google DeepMind Just Made Small Models […] (Analytics India, Nov 24)
Optimizing Edge AI […] (Giuliano Casale, Imperial College, May 24)
Everything You Need To Know About […] (Hyperstack, Nov 24)
Billions of Agents (David Shapiro, Sept 24)
How agentic AI platforms will redefine […] (SiliconANGLE, Sept 24)
Agentic Mesh: The Future of Gen […] (Towards Data Science, Nov 24)
Transform work with autonomous agents […] (Microsoft, Oct 24)
Salesforce ‘Agentforce’ (Salesforce, Aug 24)
OpenAI ‘Swarm’ (GitHub, Oct 24)
Agent-as-a-Judge: Evaluate Agents […] (Mingchen Zhuge et al, Oct 24)
OpenAI o3 (Wikipedia)
Introducing Gemini 2.0: our new AI model for […] (Google, Dec 24)
Introducing computer use, a new Claude 3.5 Sonnet […] (Anthropic, Oct 24)
Empower your organization with Copilot (Microsoft)
AI in data integration: Types, challenges, […] (Leeway Hertz, Nov 24)
Record of 4 Million Robots in […] (International Federation of Robotics, Sept 24)
Artificial intelligence in surgery: use cases […] (Vention, Jul 24)
The impact of laboratory automation on […] (Loulwah Ahmed Alhammad et al, Dec 23)
Implementing assistive robots in real-life […] (PAL robotics, Dec 23)
Jim Fan on Nvidia’s Embodied AI Lab and […] (Nvidia, Sept 24)
Deliberative alignment: reasoning enables […] (OpenAI, Dec 24)
Unpacking chain-of-thought prompting: a new […] (Toloka, Oct 24)
Developing a computer use model (Anthropic, Oct 24)
10 Effective Business Process Automation Examples […] (Motion, May 24)
2024 AI Index Report (Stanford, May 24)
2025: Navigating the Future of Innovation (Speedinvest, Jan 25)
The efficient compute frontier (Welch Labs, Aug 24)
Data Quality in AI: Challenges, Importance & […] (Altay Ataman, Nov 24)
Prioritize environmental sustainability in use of AI […] (Nature, Jan 24)
Photonic processor could enable ultrafast AI computations […] (MIT News, Dec 24)
A brief history of LLM Scaling Laws and what to expect […] (Jonas Vetterle, Dec 24)
Integration Challenges of AI in Businesses, […] (AI Boost Marketing, Jan 24)
Improve AI Thinking with Meta-Reasoning Techniques (Relevance AI)
Australian lawyer reported to regulator for filing fake […] (Legal IT Insider, Oct 24)
Moravec’s paradox (Wikipedia)
AI companies are pivoting from creating gods to building […] (AI Snake Oil, Aug 24)
Enterprises in for a shock when they realize power and […] (The Register, Jan 25)
Green AI: Building Sustainability into AI Initiatives (Technology Magazine, Sept 24)
OpenAI’s 5 Levels Of ‘Super AI’ […] (Forbes, Jul 24)
AI in 2025: Five Defining Themes (SAP News, Jan 25)
Humanity’s Last Exam (Centre for AI Safety)
Overview: Multi-Agent AI Frameworks (Bizety, Jan 25)
I-JEPA: The first AI model based on Yann LeCun’s […] (Meta, June 23)
The AI Safety Institute […] (Center for Strategic and International Studies, Oct 24)
The Imperative of AI Safety in 2025: […] (Hyper Policy, Dec 24)
Introducing the Frontier Safety Framework (Google DeepMind, May 24)
EUREKA: Evaluating and Understanding Large Foundation (Microsoft, Sept 24)
The Future Of AI: Unleashing The Power Of Quantum […] (Forbes, Jul 24)
Deloitte Global’s 2025 Predictions Report: Generative AI […] (Deloitte, Nov 24)
UK launches AI safety programme to build public […] (Open Access Government, Oct 24)
5 examples of responsible technology: a new […] (World Economic Forum, Jan 24)
Humanoid robotics set to transform […] (Control Drives & Automation, Jan 25)
AI Tools Drive Many New Products at the CES 2025 (VOA, Jan 25)
Vibe Coding’s Rise in Software Development (Perplexity ‘Discover’, Mar 25)
Requests for Startups (Y Combinator, Jan 25)