Together AI (@togethercompute) / X

Together AI

2,843 posts

Together AI

@togethercompute

Accelerate inference, model shaping, and pre-training on a research-optimized platform.

San Francisco, CA

Joined November 2022

Pinned
Together AI
@togethercompute
Jun 29
Open models are not just a pricing story. They are what happens when the AI stack becomes modular: models, APIs, harnesses, tools, and inference all improving independently. Together AI is building the inference layer for that shift.
Vipul Ved Prakash
@vipulved
Jun 29
Article
The Economy of Tokens
Carliss Baldwin and Kim Clark argued that the most important economic event in technology industries is often not the invention of a new product, but the creation of a modular architecture with stable...
85K
Together AI reposted
Zain
@ZainHasan6
55m
We gave a 2 hr deepdive on how to build inference engines that handle trillion token agentic workloads at @aiDotEngineer. Will drop slides and detailed walkthrough!
414
Together AI reposted
Victor Su-Ortiz
@VictorSuOrtiz
3h
Missed our talk with @togethercompute on @MiniMax_AI sparse attention and kernel optimizations? Catch us tomorrow at 10:45am PT.
1.7K
Together AI
@togethercompute
23h
As open models get stronger, more workloads move into the competitive inference market. That pushes the real fight toward speed, cost, reliability, and control. Together AI is where open models become production infrastructure.
Vipul Ved Prakash
@vipulved
Jun 29
Article
The Economy of Tokens
Carliss Baldwin and Kim Clark argued that the most important economic event in technology industries is often not the invention of a new product, but the creation of a modular architecture with stable...
3.2K
Together AI
@togethercompute
Jun 29
To celebrate the start of @aiDotEngineer AI Engineer World's Fair, we're launching a bracket competition! Use an agent or pick manually to choose your winners for the Round of 32 by 9:00 am PT Tuesday, June 30, for a chance to win: 1/ Lego trophy 2/ Mac Mini (for your 24/7
00:00
3.5K
Together AI
@togethercompute
Jun 29
Submit your winners here! aicup.io Thanks to @nutlope and @federicobianchy for bringing this to life 🙌
AI Cup 2026: Predict the World Cup, Win the Prizes
From aicup.io
1.3K
Together AI reposted
Vipul Ved Prakash
@vipulved
Jun 29
Article
The Economy of Tokens
Carliss Baldwin and Kim Clark argued that the most important economic event in technology industries is often not the invention of a new product, but the creation of a modular architecture with stable...
299K
Together AI
@togethercompute
Jun 28
More reason why we’re excited about GLM-5.2 on Together 👇 Strong enough for serious coding work, cheap enough to change routing decisions, and easy to access through the tools developers already use.
Harrison Kinsley
@Sentdex
Jun 28
Replying to @Sentdex
use GLM 5.2 via a USA provider that doesn't retain prompts if you care about privacy. The model is just as good as opus 4.8/gpt 5.5 It's the same speed as claude code, and goes down less often. On open router, its never since you can swap to diff provider if 1 drops. I'm
8.6K
Together AI
@togethercompute
Jun 28
There's a big difference between a single model call and serving an agent at scale. @ZainHasan6 breaks down what actually changes. Catch our team this Monday at 9 a.m. PST for their open-source inference workshop at @aiDotEngineer
00:00
3.6K
Together AI reposted
MiniMax (official)
@MiniMax_AI
Jun 27
next week at @aiDotEngineer, we are joining @togethercompute for a conversation on what goes into running agents at scale. @olive_jy_song, Research Lead, RL at MiniMax, and @realDanFu, VP of Kernels at Together AI, will walk through both sides of M3: the training decisions
20K
Together AI
@togethercompute
Jun 26
What happens when AI agents collaborate on open science? At @aiDotEngineer World’s Fair, @james_y_zou will share work on EinsteinArena and DSGym, from multi-agent math discovery to better evaluation for data science agents. Day 3, July 1. Expo Stage 3 SW.
3.8K
Together AI
@togethercompute
Jun 26
As token usage explodes, model choice becomes product strategy. Teams are already testing models like GLM-5.2 because they want frontier quality, better tokenomics, and more control over cost, data, and deployment. Together AI is building the inference layer for that open-model
Grace Isford
@graceisford
Jun 24
A huge moment for open source AI - as token usage skyrockets across orgs, so do concerns over cost & data/vendor lock-in & necessity of a multi-AI strategy Keep an eye on @Lux_Capital portfolio cos @huggingface @togethercompute @SakanaAILabs & more! 🚀
4.3K
Together AI reposted
Hassan
@nutlope
Jun 25
I love using GLM 5.2 for web app iteration. My workflow: generate 6 variations, then pick the best one and continue iterating on it. I built Recast to make this even easier. Give it a prompt, get 6 variations, download the code for your favorite version, then continue
00:00
20K
Together AI
@togethercompute
Jun 25
LLMs are getting better at writing GPU kernels. Multi-GPU kernels are the harder test. At @aiDotEngineer World's Fair, @simran_s_arora will share ParallelKernelBench, an open-source benchmark built from real CUDA communication problems where performance depends on moving data
2.9K