Grok 4.3

(docs.x.ai)

89 points | by simianwords 2 hours ago

20 comments

artdigital 1 hour ago
Grok is my favorite model for chatting, and my favorite voice mode. It seems to be the only voice mode that isn't routing to a extremely cheap model (like Haiku), and has been the highest quality out of all the frontier ones. When you subscribe to SuperGrok you can also create a "council" of agents, each with their own system prompt and when you ask something, they will all get asked in parallel to come to a conclusion. Good stuff!
Just wish they would finally put some work into their apps, it's the only thing keeping me from actually subscribing to SuperGrok:
- No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work
- Projects are still not available in the app so as soon as you move something into a project, it's gone from all the native apps
- No way to add artifacts (like generated markdown docs) directly to a project, we have to export to PDF/markdown and re-import. And there isn't even a way to export artifacts. This makes serious project work hard because we can't dynamically evolve projects with new information
- No memory, no ability to look up other chats, each chat is completely new
- No voice mode in projects at all
If someone from xAI is reading this, please consider adding some of these.
[-]
- artdigital 1 hour ago
  I also think Grok would benefit from allowing usage of "SuperGrok Heavy" (their $300 plan) in coding harnesses with included usage. Currently they give you some API credits on the Heavy plan so you can use some Grok for coding, but $300 USD value is just not there.
  Not saying they should create their own grok-code harness, just allowing usage in existing ones would already be beneficial. But that's probably what the Cursor acquisition is going to do eventually
- afpx 1 hour ago
  When I signed up, I accidently paid for a full year. So from time to time, I'll throw it something just to see what it produces compared to the other LLMs. And, even after all this time, it still feels like a really "dumb" model compared to the other frontier ones. But, worse, many of my system prompts make it go wacky and puke jibberish. However it was pretty cool for those couple months awhile back when it was uncensored. You could ask it about a wild conspiracy, and it would actually build the case and link you to legitimite source material. They dropped the hammer down on that real quick.
  [-]
  - 2ndorderthought 20 minutes ago
    Ah yes the psychosis reinforcement vertical. It's such a lucrative market for those schizophrenics and bipolars. Great way to get lots of engagement. Groks portfolio is so diverse
    [-]
    - readthenotes1 13 minutes ago
      I have a schizophrenic relative who is in such a relationship with grok. Instead of telling hen you need to take your meds, it says hen is the smartest person in the world
      [-]
      - 2ndorderthought 11 minutes ago
        I'm so sorry your family is suffering from this. I hope you can find a way to bring them back. Disorders featuring psychosis are so painful for everyone around them. Blessings to you and your family
- walletdrainer 57 minutes ago
  > No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work
  Grok has tool use, no? Why would you also need MCP? What does MCP add?
  [-]
  - artdigital 32 minutes ago
    I'm talking about the consumer Grok app and grok.com website. There currently are not connected apps (or MCP) at all, so while Grok can use tools, there is no way to add tools to it
sundarurfriend 1 hour ago
As an English-as-second-language speaker and writer, one thing Grok really shines at is capturing the tone and level of "formality" of a piece of text and the replicating it correctly. It seems to understand the little human subtleties of language in a way the other major providers don't. Chatgpt goes overly stiff and formal sounding, or ends up in a weird "aye guvnor" type informal language (Claude is sometimes better but not always).
Grok seems in general better at being "human" in ways that are hard to define: for eg. if I ask it "does this message roughly convey things correctly, to the level it can given this length", it will likely answer like a human would (either a yes or a change suggestion that sticks to the tone and length), while Chatgpt would write a dissertation on the message that still doesn't clear anything up.
Recently I've noticed that Grok seems to have gotten really good at dictation too (that feature where you click the mic to ask it something). Chatgpt has like 90-95% accuracy with my accent, the speech input on Android's Gboard something like 75%, Grok surprisingly gets something like 98% of my words correct.
[-]
- djyde 1 hour ago
  I've also noticed that when I communicate with Grok in my native language, its tone is more natural than other models. I think this is due to the advantage of being trained on a large amount of Twitter data. However, as Twitter contains more and more AI-generated content now, I'm afraid continued training will make it less natural.
  [-]
  - pacific01 32 minutes ago
    Did you try meta? I was into grok but now meta works well for me
  - thunderbong 56 minutes ago
    I'm sure Twitter knows which are the bot accounts and is surely excluding them from their model training. Twitter bots aren't a new phenomenon after all.
    [-]
    - pixel_popping 24 minutes ago
      There is bots everywhere, it has nothing to do with the platform, it has to do with attackers having an incentive to do mass account farming, no platform is secure against it.
tornikeo 1 hour ago
So, we have: - claude for corps and gov - codex for devs - grok for what, roleplay, racism? Those are the two things I've ever heard grok associated with around me.
[-]
- sudb 48 minutes ago
  So interestingly, I know of at least one application in a charity that deals with trafficking where grok was happy to do one-shot classification tasks where all other models refused to cooperate.
  I think there's a surprising number of actually useful applications in this sort of grey area for a slightly-less guardrailed, near-frontier model (also the grok-fast models are cheap!).
  [-]
  - 2ndorderthought 30 minutes ago
    There are lots of uncensored models out there. I don't think grok is leading in that front. They kind of pick and choose which things they want to support based on elons world views. Elon used to hang out with sex traffickers so of course grok is fine talking about it. Probably even offers strategies for them does free accounting has money laundering strategies etc...
    [-]
    - spiderfarmer 20 minutes ago
      For the naysayers: https://www.theguardian.com/technology/2026/jan/30/elon-musk...
      [-]
      - user34283 2 minutes ago
        We have been over the politically motivated slander many times; it's boring.
        The user above you could have explained what uncensored models he believes are more capable than Grok. Maybe the Chinese open-weights models are superior to Grok at the moment.
- coreyh14444 21 minutes ago
  If you need to ask about what people on Twitter are talking about, Grok is really good for that obviously. I use it all the time for "what are the cool kids on twitter saying is the best tiling window manager these days" or whatever. Also, if you have a question that's borderline shady, Grok will often deliver. "Can you find a grey market Windows license site for me" etc.
- ndr 1 hour ago
  You should try all of them, then update your opinion about your information sources accordingly.
- nsowz 1 hour ago
  Grok is as progressive as any of the other models. Despite some of the highly-publicised fuck-ups, try asking Grok anything racist and see how it replies. Yes, I know you didn't try this and you won’t.
  [-]
  - aqme28 59 minutes ago
    There is a lot of daylight in between “progressive” and “openly explicitly racist”
    [-]
    - 2ndorderthought 39 minutes ago
      Isn't grok currently holding the world record for the biggest generator of CSAM? Or did they change focus to enhance their racism and propaganda vertical? Things move so quickly these days hard to keep up!
      [-]
      - addedGone 12 minutes ago
        Mistral will also tell you how to do ransoms btw from A to Z in automated ways, you are saying they are responsible? I don't get the mix here.
        [-]
        2ndorderthought 8 minutes ago
        Yes any company generating csam should not be in business as a legitimate entity. Can you send me a link from a reputable enough source where Mistral models have done this? I didn't even realize they were doing image generation.
    - simianwords 36 minutes ago
      Can you share a prompt that can show how it is openly racist now? Lots of easy claims like this can be debunked
    - nsowz 56 minutes ago
      I didn’t say “progressive”; I said “as progressive”.
      [-]
      - aqme28 12 minutes ago
        I don't see how that changes my point at all.
        edit: to clarify for you, here's an example.
        Model A advocates for single-payer healthcare, while Model B prefers for the current US healthcare system. So on that one axis, A is more progressive than B. Neither of them needs to be racist for that calculation.
  - SanjayMehta 59 minutes ago
    100% agree. Grok may or may not be biased one way or the other as far as the US is concerned but from the rest of the world perspective it's mostly the same as any other model trained on Wikipedia.
- khalic 38 minutes ago
  Lol. I think they unleashed it on this post, look at the number of only vaguely related, lukewarm opinions trying to push the racism and CSAM stuff to the bottom
  [-]
  - johnnyApplePRNG 15 minutes ago
    [flagged]
- vrganj 1 hour ago
  Grok for furthering the far-right filter bubble Elon has been hard at work building.
  [-]
  - khalic 37 minutes ago
    And of course child porn
    [-]
    - pixel_popping 36 minutes ago
      [flagged]
      [-]
      - 2ndorderthought 29 minutes ago
        That's what it was doing. Like literally. Chatgpt it or Google it. Supporting grok is paying money to a csam generator.
        Edit I cannot reply to the post below me. I have gone entirely over to local models so I am paying zero dollars to any of the us defense contractors that are also tech companies. It's awesome.
        [-]
        pixel_popping 28 minutes ago
        [flagged]
        pixel_popping 21 minutes ago
        [flagged]
      - khalic 31 minutes ago
        Grok was used to create CSAM
        [-]
        pixel_popping 30 minutes ago
        [flagged]
        [-]
        spiderfarmer 18 minutes ago
        Musk partied with Epstein.
        [-]
        pixel_popping 16 minutes ago
        What's the correlation with Grok, the AI model?
      - spiderfarmer 19 minutes ago
        https://www.theguardian.com/technology/2026/jan/30/elon-musk...
        [-]
        pixel_popping 18 minutes ago
        What's the correlation with Musk and Grok (an AI model, kinda crazy how far fetch this is) producing "CP data" (which any model can do)?
  - simianwords 35 minutes ago
    How does Grok further far-right filter? This is blatantly untrue. Try prompting it and getting it to say something far right.
    Grok if anything reduces populism because fake claims can be debunked
    [-]
    - vrganj 33 minutes ago
      How could MechaHitler possibly be far right...
      [-]
      - 2ndorderthought 27 minutes ago
        When you really think about it palantir told me Hitler was good and therefore mechahitler aka grok should be a okay!
      - simianwords 31 minutes ago
        [flagged]
        [-]
        vrganj 29 minutes ago
        Sure. And so was the Holocaust denial and the misinformation about the French 2015 attacks and so on and so on.
        Its just roleplaying being a far right propaganda tool.
        [-]
        simianwords 26 minutes ago
        Source?
        [-]
        vrganj 21 minutes ago
        https://www.pbs.org/newshour/world/france-will-investigate-m...
        [-]
        simianwords 18 minutes ago
        As admitted they have fixed it. It’s obvious that a tool used so vastly might have problems like this. Surely if you think it is used to produce far right propaganda now you can reproduce? Or you choose to hinge on one off issues they fixed?
maz1b 1 hour ago
I still wish they named it something else, but congratulations to the team on what seems to be a good release!
Pricing is also quite surprising, compared to comparable competitors. I guess they have tons of capacity or really want to bring over more people.
[-]
- readthenotes1 11 minutes ago
  You don't like science fiction references in general or Heinlein in particular?
  [-]
  - draxil 1 minute ago
    I don't like that word, which was previously a common part of my vocabulary, being forever ruined?
mythz 1 hour ago
Ok speed (202.7 tok/s) and value (1.25 -> 2.50) look great, with pretty decent intelligence.
[-]
- pzo 1 hour ago
  The problem with speed is that they usually are very fast for first few weeks and then suddenly much slower. They did such trick when they advertised Grok 4 fast ( dropped from 200 tps to 60tps)
  [-]
  - victorbjorklund 1 hour ago
    Wow. That is a big drop.
- catcowcostume 1 hour ago
  [flagged]
  [-]
  - kuboble 1 hour ago
    I don't remember the source of the quote.
    But debating whether the models are intelligent is slim to debating whether a car can walk.
    You can offload to the model a lot of work that until recently we thought requires intelligence. The more and better of those tasks the model can do, it's fair to call it intelligence*
    [-]
    - NitpickLawyer 37 minutes ago
      "The question of whether a computer can think is no more interesting than the question of whether a submarine can swim." - Edsger Dijkstra
  - MrDrDr 1 hour ago
    Please elaborate.
    [-]
    - IshKebab 22 minutes ago
      Some people have this strange idea that only "whatever humans do" counts as intelligence, despite the fact that a) we don't really have a clue what humans do, and b) "intelligence" is definitely not that strictly defined.
      I think they're just trying to feel like they know some important truth that other people don't.
    - nesk_ 1 hour ago
      Prediction is not intelligence.
      [-]
      - mirekrusin 58 minutes ago
        Misprediction is?
  - exe34 1 hour ago
    What does intelligence mean to you?
netdur 1 hour ago
In court vs openai, Musk said Grok is partly trained on openai models, so it should be somehow similar to Chinese models in terms of performance and cost!
alyxya 1 hour ago
Despite their attrition, this combined with their cursor partnership is likely going to make them competitive in coding agents soon.
mirekrusin 49 minutes ago
All those plans from providers should be sliders – prepay more, get more in return.
agunapal 38 minutes ago
Very competitive price for the speed and intelligence being offered!
OtherShrezzing 1 hour ago
The tok/s stat is interesting. Since the dominant constraint on inference speed is hardware, it suggests X purchased far more compute than was really needed to serve the demand for their models.
Expensive miscalculation.
[-]
- flir 39 minutes ago
  Didn't a bunch of hardware that was destined for Tesla get redirected to xAI? I'm sure I remember something like that.
simianwords 2 hours ago
https://artificialanalysis.ai/models/grok-4-3
[-]
- nextaccountic 1 hour ago
  This puts Sonnet 4.6 above Opus 4.6 in the coding index.. kinda hard to trust those numbers.
  (Also it puts Opus 4.7 universally above Opus 4.6, and I may be wrong but this doesn't seem to match the experience of most/many/some people. I think it's widely recognized that Anthropic is severely lacking compute and Opus 4.7 is a costs saving measure)
  [-]
  - manmal 1 hour ago
    Anthropic themselves have (had?) this thing where Opus is used for planning and Sonnet for coding.
- Alifatisk 1 hour ago
  Does numbers don't look exciting at all? I may have gotten spoiled by releases from Qwen, Kimi and Z.ai who keep closing the gap between closed weight SOTA models and open weight. From my experience, Grok is only useful for one thing, and that's looking up things for you and gathering a consensus on topics. That's it.
  Update, I noted that Grok 4.3 is in the "Most attractive quadrant", that's cool! It is also in the top 5 highest in "AA-Omniscience Index", good! Really good.
- progbits 1 hour ago
  What's with the charts and numbers?
  It says #1 for speed but then in the chart it's #2. Also says #10 for intelligence but then it's #7 in the chart.
- BoorishBears 1 hour ago
  What an exciting game we're playing, where the most popular leaderboard is completely made up and the stakes are in the trillions.
khalic 1 hour ago
This project is a gigantic waste of resources, it’s fine tuned on politics of the CEO, was used for CSAM generation and just sucks overall
[-]
- johnnyApplePRNG 11 minutes ago
  The resource waste he's talking about is horrendous, read more here: https://time.com/7308925/elon-musk-memphis-ai-data-center/
- spiderfarmer 1 hour ago
  It’s a model made for 36% of Americans. The rest of the world can’t care less.
  [-]
  - 2ndorderthought 36 minutes ago
    Considering how few Americans there are and how little of that 39% even uses technology, that's what 20 million people at a maximum?
    [-]
    - Hugsun 22 minutes ago
      That seems like a decently sized market. Maybe not for an AI lab though.
      [-]
      - 2ndorderthought 17 minutes ago
        Sure it's a good market for a normal company. For a social media company it's pretty isolated and really limits the products that can come out. But their current verticals: propaganda, csam, and psychosis engagement are quite strong amongst that population.
        [-]
        cindyllm 13 minutes ago
        [dead]
- servo_sausage 54 minutes ago
  I like that there are models with divergent politics; the status quo being creepy corporate left silicon valley is not healthy or pleasant to interact with.
  Even with grock it's only broadening things to creepy corporate right of silicon valley.
BoredPositron 1 hour ago
Yay, free tokens. I don't know why but grok always seems good fast in the free token phase and after that degrades.
Imustaskforhelp 1 hour ago
Pelican riding a bike here: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...
(ran this on arena.ai direct chat and also tried to write this gist inspired by how simon writes his gists about pelicans)
Edit: just realized that I made pelican riding a bike instead of bicycle, which now makes sense as to why it hardened the bicycle to look tankier, going to compare this with pelican riding a bicycle if anybody else shares the pelican riding a bicycle.
[-]
- gchamonlive 1 hour ago
  https://simonwillison.net/2025/Nov/13/training-for-pelicans-...
  You should probably come up with variations, like a beaver riding a scooter or something, just to see what's what :)
  [-]
  - Imustaskforhelp 1 hour ago
    Thanks I have generated both
    beaver riding a scooter: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...
    pelican riding a bicycle: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...
    Personal opinion but the beaver one looks especially bad as compared to pelicans. Can we be for sure that this model of grok-4.3 hasn't been trained on pelican. Simonw in blog-post says that he will try with other creatures so I hope he does that but it does feel to me as the model/xAI is trying to cheat, Hope Simonw tests it out more.
    Edit: Also added turtle riding a scooter, something which literally has images online or heck even teenage mutant ninja turtles and I thought that it would be able to pass this but it wasn't even able to generate this: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...
    This literally looks more avocado than turtle. Perhaps this could be a bug from arena.ai or something else too, not sure but at this point waiting for simon's analysis.
    [-]
    - gchamonlive 54 minutes ago
      We can never be sure of course, but I think this is a very strong indication that pelican riding a bike is indeed going into the training dataset.
      Thanks for generating those!
happosai 1 hour ago
I lost the trust in them when they added the racist "what about killing of Boers in south Africa" thing to their system prompt.
No way am I going to use a model where the backing has such blatantly obvious brain washing goals.
[-]
- Hugsun 19 minutes ago
  It is unbelievable that this is a controversial opinion.
- miroljub 1 hour ago
  [flagged]
  [-]
  - vrganj 1 hour ago
    There is no non-bias. What you call unbiased is always just a reflection of your personal biases.
    That being said, I am definitely against a model that is biased to be following the ideology of a far-right extremist.
  - Jtarii 57 minutes ago
    Musk bought a social media company for the specific purpose of getting Trump elected by turning it into a right wing propaganda machine. Have Anthropic/OpenAI/Google done something similar to that?
  - henry2023 32 minutes ago
    [dead]
unit149 23 minutes ago
[dead]
shchess 50 minutes ago
[dead]
th3b0tk1ll3r 1 hour ago
[flagged]
[-]
- curtisblaine 1 hour ago
  Please avoid comments with no real substance, written just to denigrate, with a throwaway account. They make discourse unnecessarily worse.
alfiedotwtf 1 hour ago
If there was any model I wouldn’t trust, it wouldn’t be the ones from China, it would be the one from Elon Musk
[-]
- Cthulhu_ 1 hour ago
  Thankfully it's not an either / or, I don't trust any models. This is a healthy attitude to have because you shouldn't trust anyone on the internet either, especially when it comes to specific subjects.
  [-]
  - 2ndorderthought 33 minutes ago
    I don't trust this. But by not trusting it I am inherently trusting it. But by trusting it I shouldn't.
ragchronos 1 hour ago
When looking at the benchmarks, this model seems to be really close to Kimi K2.6 in terms of intelligence and pricing, hitting that sweet spot. It does also have a higher AA-Omniscience index, which is something kimi and other open models lack in. Curious to see how pleasant it is to use.
[-]
- alfiedotwtf 1 hour ago
  I’ll eat my hat if it even comes close to Kimi
  [-]
  - mirekrusin 56 minutes ago
    How would you like it? Well done?
    [-]
    - __patchbit__ 11 minutes ago
      What about spending $41 million on each model's tokens and seeing the value gain? be it efficiency gain in factory work or energy savings in austere battlescape hunting.