xAI Interview with Sulaiman Ghori
Comprehensive Analysis: xAI Interview with Sulaiman Ghori
A deep-dive analysis of the Relentless podcast interview covering xAI’s culture, technology, and operational philosophy.
Executive Summary
This interview with Sulaiman Ghori (“Sully”), an engineer at xAI, provides an unprecedented inside look at how one of the fastest-growing AI companies operates. The key themes that emerge are:
- Extreme velocity - “Yesterday” is the deadline, not a future date
- Hardware as moat - Physical infrastructure is their primary competitive advantage
- Small model strategy - Counter to industry trends, prioritizing speed over size
- Flat organization - Only 3 management layers enable rapid decision-making
- Tesla synergy - Leveraging 4M+ Tesla vehicles as distributed compute
- First principles - Aggressively eliminating “perceived limitations”
Part 1: Company Culture and Values
Speed as the Core Value
The dominant cultural theme at xAI is extreme velocity. The company operates with a philosophy that conventional timelines are artificial constraints to be eliminated.
“We don’t have really due dates. It’s always yesterday.”
“There’s no blockers for anything—at least nothing artificial.”
This manifests in practical ways:
- Colossus data center built in 122 days
- Model iterations happening daily, sometimes multiple times per day
- Hardware can be training within hours of rack setup
First Principles Thinking
The company embraces Elon’s philosophy of reducing problems to their physical or fundamental constraints:
“The whole Elon thing about going down to the root, the fundamental, whatever the physical thing is, we get there pretty quick if we can, as quick as we can.”
The “No Bureaucracy” Philosophy
Ideas can go from concept to production the same day:
“No one tells me no. If I have a good idea I can usually go and implement it that same day and show it to Elon or whoever and we got an answer.”
Decision-making is rapid and binary:
“Usually you propose something and the answer is either ’no that’s dumb’ or ‘why isn’t it done already?’”
Everyone is an Engineer
There’s a strong bias toward technical capability across all roles:
“The sales team is engineers. Everyone is an engineer. I think at the time there was probably less than 8 people who were not engineers at the company.”
Part 2: Organizational Structure
Flat Hierarchy - Only Three Layers
The organization is remarkably flat:
“There’s basically only three layers of management. There’s ICs, there’s the co-founders and some of the new managers, and then Elon and that’s it.”
Fuzzy Team Boundaries
Teams are intentionally fluid with minimal rigid structure:
“The fuzziness between teams and what everyone is responsible for is definitely not what I expected and I don’t think exists nearly as much in any large company.”
“If I need to fix something on our VM infrastructure, I will do it. I will show it to the guy who owns that and they will be like okay and it’s merged immediately and deployed.”
Organic Ownership Through Contribution
“With Elon companies, you can kind of just ask for responsibility and then you basically just live by the sword, die by the sword. If you get things done, you can ask for more responsibility and keep doing that—or you’re just out.”
Part 3: Technical Approach and Speed
The 2-8x Improvement Principle
“There is a lot of perceived limitations especially in the software world coming from the last 10 years of web dev. People just assume or accept certain limitations especially when it comes to speed and latency—and they’re not true.”
“You can get rid of a lot of overhead. Like there’s a lot of stupid stuff in the stack and if you can knock out a lot of that, you can usually 2x to 8x most anything at least anything invented relatively recently.”
Daily Model Iterations
“We’re coming out with new iterations daily, sometimes multiple times a day, which is from pre-train in some cases—which is not something you ordinarily really see.”
The 24-Hour Iteration Cycle
“We had this 24-hour iteration cycle. We would get feedback every night on whatever we were doing. We would push out that night. In the morning we would have all the feedback. We would immediately knock out all the bugs, implement the new stuff people were asking for.”
The “Delete Then Re-add” Methodology
Following the SpaceX algorithm:
“We do the thing where we delete something and then add it back later.”
When asked for the last time this happened: “Today.”
Part 4: Macro Hard - The Human Emulator
Core Concept
Macro Hard is xAI’s project to do for digital work what Optimus does for physical work:
“With Optimus, you’re taking any physical task a human can do and allowing a robot to do it automatically at a fraction of the cost with 24/7 uptime. We’re doing the same with anything that a human does digitally.”
The key differentiator: No software adoption required. They emulate keyboard/mouse inputs and screen reading directly.
Speed Over Size Strategy
Counter to industry trends of building larger models, xAI prioritizes speed:
“The decision to go with a model that would be at least 1.5x faster than a human for Macro Hard—looking like significantly faster than that. 8x maybe, maybe more.”
“For other human emulator attempts at other labs, the approach has been: let’s do more reasoning and build a bigger model. That decision put us on totally the opposite track of what everyone else is doing.”
The rationale is practical:
“No one’s going to wait around 10 minutes for the computer to do something they could have done in five, but if it can be done in 10 seconds? They’d be happy to pay whatever amount of money for that.”
The Tesla Computer Strategy
One of the most innovative aspects revealed:
“If we want to deploy 1 million human emulators we need 1 million computers. How do we do that?”
“There’s like 4 million Tesla cars in North America alone. Let’s say half of them have hardware 4. Somewhere between 70-80% of the time they’re sitting there idle, probably charging. They have networking, they have cooling, they have power. We can just pay owners to lease time off their car and let us run a human emulator on it.”
No buildout required—purely a software implementation.
Internal Testing with Virtual Employees
xAI has been testing human emulators internally, sometimes without telling employees:
“In some cases there’ll be someone doing some work and someone is like, ‘Hey, can you help me with this thing?’ And the virtual employee is like, ‘Yeah, sure. Come to my desk.’ And they go there and there’s nothing there.”
“Multiple times I’ve gotten a ping saying: ‘Hey, this guy on the org chart reports to you. Is he not in today or something?’ It’s an AI. It’s a virtual employee.”
Generalization Capabilities
“Just today we gave Elon a few cases where we did not train on this task at all, but it did it flawlessly—way better than we would have expected.”
Part 5: Infrastructure and Hardware
Hardware as Competitive Advantage
“It’s probably our biggest edge—the hardware—because nobody else is even close on the deployment there.”
“It’s kind of hardware at this point. It’s like hardware constrained.”
The Colossus Build (122 Days)
This speed enabled long-term planning:
“The speed allows us to think more long-term. Grock 4 or 5 was already planned out and designed in terms of size way early, before I joined.”
The “Carnival Permit” Hack
“The lease for the land itself was actually technically temporary. It was the fastest way to get the permitting through.”
“There’s basically a special exception within local and state government that says if you want to modify this ground temporarily—I think it’s for carnivals and stuff.”
“xAI is actually just a carnival company currently.”
Power Management
Managing megawatt-scale power fluctuations:
“We have to collaborate very tightly with municipal and state power companies. When load goes high on their end, we have to shut off and go fully on 80+ mobile generators we brought in on trucks—seamlessly, without interrupting anyone’s extremely volatile training runs on hardware which scales up and down by megawatts in milliseconds.”
Multi-layer power architecture: Local capacitors → Data hall capacitors → Batteries → Generators → Municipal power
Part 6: Elon’s Management Style
The “Fire to Fire” Approach
“One of the things Elon does best is he basically just goes from fire to fire on whatever the company is and just puts it out and unfucks whatever problem exists.”
Direct Blocker Removal
“He would hear these [issues] and he would make a phone call and the software team would deliver a patch the next day. We would work side by side until that was resolved—where otherwise it would have taken weeks of back and forth.”
Bimodal Feedback (High-Level or Low-Level)
“Usually it’s either at a very high level or at a very low level. It’s not really often in between.”
- High level: Product direction, segment focus, strategic decisions
- Low level: Compute efficiency, latency, specific technical suggestions
Proof Over Opinion
“He’s open to being proven wrong, but it has to be proof. It has to be: let’s try it and see what the results are. It can’t be just someone’s opinion. There has to be an experiment done.”
Timeline Philosophy
“He always says: you can always attempt to do something in one month that would otherwise take a year and you’ll probably get it done maybe in two. Still a lot faster.”
Part 7: Hiring Philosophy
Engineers, Not Researchers
“Engineers. Just engineers. Doesn’t matter. Good engineers. Just someone who’s fundamentally a problem solver. Doesn’t matter if they did this XYZ thing in this infrastructure or this particular architecture. Engineers.”
Finding Simple Solutions
Sulaiman uses a specific interview technique:
“I have a very specific computer vision problem I solved. I give people half an hour to implement the solution. It’s actually very simple. A deceptively simple solution. People always overthink it. I index for: can you not overthink it and come up with a simple solution?”
“An AI will happily churn out 200 lines when a 10-line solution will do. I want people who can find the 10-line solution first.”
The Incorrect Requirement Test
“He throws in an incorrect requirement or impossible line in his coding challenges. He expects people to come back and say ‘hey, this is wrong, this is not possible, you made a mistake.’ If they don’t, he doesn’t hire them.”
Extreme Team Efficiency
“Our iOS team is small for how many people use it. It’s ridiculous. You won’t guess the number. It was three—and I was the third person.”
“A job for one person done by two will take twice as long. Especially now that you don’t need to physically write as much code. Everyone can be an architect. One brain can do a lot more.”
Part 8: Key Metrics and Numbers
| Metric | Value |
|---|---|
| Value per commit | $2.5 million |
| Colossus build time | 122 days |
| Engineering staff at Sully’s joining | ~100 people |
| iOS team size | 3 people |
| Management layers | 3 |
| Mobile generators | 80+ |
| Model speed target | 8x faster than human |
| Tesla cars in North America | ~4 million |
| Non-engineers at early xAI | <8 people |
| Speed improvement potential | 2-8x on most things |
Part 9: Core Mental Models
1. Work Backwards from the Constraint
“Very good at figuring out what the bottlenecks will be even years in the future and then trying to work backwards from that.”
2. Single Metric Focus
“Very quickly we come up with a metric that’s core to either the financial or the physical return. Everything is just focused on driving that metric.”
3. Delete First, Then Optimize
The SpaceX algorithm applied to software: aggressively remove things and add back only what’s proven necessary.
4. Timeline Compression Through Assumption Auditing
“The estimated time is based on assumptions. You look at the assumptions, ask: proportionally how much is this impacting my timeline? Knock it out or change it. Do that a few times, you can meet whatever requirement you want.”
5. One Brain is More Efficient
“A job for one person done by two will take twice as long. You just don’t need as many hands. One brain can do a lot more.”
6. Binary Responses Only
“The answer is either ’no that’s dumb’ or ‘why isn’t it done already?’”
Part 10: Personal Background - Sulaiman Ghori
Early Entrepreneurship
- Learned programming at 11; started making money by 12 selling game hacks
- Built a RepRap 3D printer at 13 (copper winding went 2 inches into his thumb during assembly)
- Ran a fidget spinner manufacturing and distribution business from his bedroom (shut down by the county after 2 months)
- Built a liquid fuel rocket engine in 4 weeks, fired it standing 6 feet away (jacket caught fire)
Philosophy: “Healthy Disrespect for Authority”
“I’ve always known from very young—I want an unconventional outcome. Going through a conventional path would pretty much necessarily not lead you to an unconventional outcome. So I grew opposed to any form of convention, and institutions necessarily enforce convention.”
Why xAI Over SpaceX/Tesla
“xAI is definitely the smallest company, the newest of all of them. My assumption is—and this is largely proven true—where you can have the most leverage and change as an individual person, because proportionately you’re a much larger percentage of the company.”
Conclusion: What Makes xAI Different
xAI’s speed comes from a combination of factors:
- Extremely flat hierarchy (3 layers) eliminating communication overhead
- Rejection of conventional software limitations through physics-first thinking
- 24-hour iteration cycles with immediate feedback integration
- Deliberate deletion of complexity and willingness to re-add only when proven necessary
- Working backwards from outcomes rather than forwards from constraints
- Massive parallel experimentation with low cost of failure
- High trust, low process environment where engineers own their domains completely
- Hiring for simplicity-seekers who challenge requirements rather than accept them
- Hardware infrastructure as moat - built in 122 days while competitors outsource to cloud
- Cross-company synergies - Tesla vehicles as distributed compute network
The 122-day Colossus build wasn’t just a one-time achievement—it established a culture and capability for rapid infrastructure deployment that continues to compound as a competitive advantage. Combined with the counter-intuitive small model strategy and the potential to deploy across millions of Tesla vehicles, xAI has positioned itself on a fundamentally different trajectory than other AI labs.
As Sulaiman puts it: “The levers are extremely strong.”