Side-by-side comparison of AI visibility scores, market position, and capabilities
UK AI chip startup with novel in-memory compute architecture. GBP 100M UK investment commitment. Backed by NATO Innovation Fund. Potential $1B round (2026). Founded 2022, London.
Fractile is a UK-based AI chip startup founded to address one of the most pressing bottlenecks in large-scale AI deployment: the energy and latency costs of running inference on large language models with conventional GPU-based architectures. The company was founded by a team of hardware engineers and computer architects with the conviction that a fundamentally different approach to computation — one that performs arithmetic directly within memory rather than shuttling data between separate processing and memory units — could deliver orders-of-magnitude improvements in inference efficiency.\n\nFractile's core technology is a novel in-memory compute architecture that co-locates processing logic with the memory cells storing model weights, dramatically reducing the memory bandwidth bottleneck that constrains GPU performance on LLM inference workloads. This approach is particularly well-suited to the weight-loading characteristics of transformer-based models, where memory movement, not raw compute, is the primary performance and energy limiter. The company is developing custom silicon targeting cloud inference data centers and potentially edge deployment scenarios where power and thermal envelopes are highly constrained.\n\nFractile has secured a commitment of GBP 100 million from the UK government as part of national AI infrastructure investment initiatives and is backed by the NATO Innovation Fund, reflecting the strategic defense and national security implications of sovereign AI inference capability. The company is in discussions for a funding round valued at approximately $1 billion, which would establish it as the highest-valued AI chip startup in the United Kingdom and a significant challenger in the competitive AI inference silicon market alongside Groq, Cerebras, and SambaNova.
Serverless GPU cloud platform for AI/ML with Python-native deployment and per-second billing; developer-favorite scaling from zero competing with Replicate and Beam for AI compute.
Modal is a serverless cloud computing platform purpose-built for AI and machine learning workloads — providing on-demand GPU compute that scales instantly from zero with per-second billing, container management, distributed training support, and a Python-native developer experience that makes running ML workloads in the cloud feel as simple as running code locally. Founded in 2021 in New York City and backed by Redpoint Ventures and other investors, Modal has grown rapidly as AI development has accelerated demand for flexible, developer-friendly GPU infrastructure.\n\nModal's developer experience is its primary differentiator — engineers write Python functions decorated with @modal.function() and deploy them to the cloud with a single command, with Modal handling container building, GPU provisioning, auto-scaling, and execution. The platform supports training jobs that need distributed compute across multiple GPUs, model serving endpoints that scale to zero when unused (eliminating idle GPU costs), and batch inference jobs that process large datasets. The per-second billing model means developers pay only for actual compute time, not provisioned instances.\n\nIn 2025, Modal competes in the AI infrastructure market with Replicate, Beam, Banana, and major cloud providers' managed ML services (AWS SageMaker, Google Vertex AI, Azure ML) for serverless GPU compute. The market for AI-specific cloud infrastructure has grown dramatically as the number of ML engineers deploying models to production has expanded — traditional cloud providers require significant DevOps expertise to use GPU instances effectively, while Modal's Python-native approach reduces the barrier to entry. Modal has attracted a strong developer following among AI researchers and ML engineers building production AI applications. The 2025 strategy focuses on growing the developer community, adding enterprise features (dedicated GPU capacity, private networking, compliance), and expanding the hardware options available (H100 GPUs, custom accelerators).
Monitor how your brand performs across ChatGPT, Gemini, Perplexity, Claude, and Grok daily.