{"source":"manifold","id":"OmLCYiszTZ3oHOn2bjAq","ticker":null,"slug":"will-gpt5-be-able-to-solve-ab-syste","title":"Will GPT-5 be able to solve A::B system puzzles consistently","description":"Twitter user VictorTaelin tweeted the following: (original post)\n\n\"A simple puzzle GPTs will NEVER solve: As a good programmer, I like isolating issues in the simplest form. So, whenever you find yourself trying to explain why GPTs will never reach AGI - just show them this prompt. It is a braindead question that most children should be able to read, learn and solve in a minute; yet, all existing AIs fail miserably. Try it! It is also a great proof that GPTs have 0 reasoning capabilities outside of their training set, and that they'll will never develop new science. After all, if the average 15yo destroys you in any given intellectual task, I won't put much faith in you solving cancer. Before burning 7 trillions to train a GPT, remember: it will still not be able to solve this task. Maybe it is time to look for new algorithms.\"\n\nThe tweet contained an image with the following prompt:\n\n\"A::B is a system with 4 tokens: A#, #A, B# and #B.\n\nAn A::B program is a sequence of tokens. Example:\n\n B# A# #B #A B#\n\nTo compute a program, we must rewrite neighbor tokens, using the rules:\n\n A# #A ... becomes ... nothing\n\n A# #B ... becomes ... #B A#\n\n B# #A ... becomes ... #A B#\n\n B# #B ... becomes ... nothing\n\nIn other words, whenever two neighbor tokens have their '#' facing each-other,\n\nthey must be rewritten according to the corresponding rule. For example, the\n\nfirst example shown here is computed as:\n\n B# A# #B #A B# =\n\n B# #B A# #A B# =\n\n A# #A B# =\n\n B#\n\nThe steps were:\n\n1. We replaced A# #B by #B A#.\n\n2. We replaced B# #B by nothing.\n\n3. We replaced A# #A by nothing.\n\nThe final result was just B#.\n\nNow, consider the following program:\n\nA# B# B# #A B# #A #B\n\nFully compute it, step by step.\"\n\n(The original post has tabs which I couldn't get working here)\n\nResolution criterion (important details)\n\nThis market will resolve YES if GPT-5 can solve these kinds of problems with good consistency (this will be judged by me) and NO if it can't. In the end all of the details will be judged using my best judgement but here are some important details / clarifications.\n\nSome important details:\n\nTo count, the puzzles will have to be sufficiently long (at least 20 in-game tokens). The given prompt has to be identical to the original tweets image except for the line after \"Now, consider the following program\". GPT-5 is not allowed to use external tools (what is counted as an external tool is decided by my best judgement). For example it is not allowed to write a python program and run it with the code interpreter. If GPT-5 has a built-in code interpreter (or something equivalent) that can't be turned off, the market will resolve as N/A.  The model has to be named GPT-5 (Or something very similar. This will again be decided by my best judgement). If OpenAI doesn't release a model called GPT-5 before 2030, the market will resolve as N/A. ","image":null,"icon":null,"active":true,"closed":false,"start_date":"2024-04-05T14:04:20.396000Z","end_date":"2030-01-01T21:59:00Z","closed_time":null,"volume":11987.393678529246,"volume_24hr":0.0,"volume_24h_change":null,"normalized_vol_24hr":null,"normalized_volume":33.111305236816406,"liquidity":1010.0,"open_interest":0.0,"categories":["Science and Technology"],"tags":[],"synthetic":true,"is_group":false,"group_key":null,"parent_event_id":null,"probability":0.143955,"spread":null,"top_outcome":"Will GPT-5 be able to solve A::B system puzzles consistently","top_outcome_probability":0.143955,"top_outcome_prob_24h_change":0.0,"top_outcome_volume_24h_change":0.0,"updated_at":"2026-06-03T06:46:47.573580Z","fetched_at":"2026-06-03T06:46:47.573580Z","added_at":null,"url":"https://manifold.markets/Guuber3/will-gpt5-be-able-to-solve-ab-syste","chart_24h":[0.143955,0.143955],"markets":[{"source":"manifold","id":"OmLCYiszTZ3oHOn2bjAq","event_id":"OmLCYiszTZ3oHOn2bjAq","slug":"will-gpt5-be-able-to-solve-ab-syste","question":"Will GPT-5 be able to solve A::B system puzzles consistently","group_item_title":null,"description":"Twitter user VictorTaelin tweeted the following: (original post)\n\n\"A simple puzzle GPTs will NEVER solve: As a good programmer, I like isolating issues in the simplest form. So, whenever you find yourself trying to explain why GPTs will never reach AGI - just show them this prompt. It is a braindead question that most children should be able to read, learn and solve in a minute; yet, all existing AIs fail miserably. Try it! It is also a great proof that GPTs have 0 reasoning capabilities outside of their training set, and that they'll will never develop new science. After all, if the average 15yo destroys you in any given intellectual task, I won't put much faith in you solving cancer. Before burning 7 trillions to train a GPT, remember: it will still not be able to solve this task. Maybe it is time to look for new algorithms.\"\n\nThe tweet contained an image with the following prompt:\n\n\"A::B is a system with 4 tokens: A#, #A, B# and #B.\n\nAn A::B program is a sequence of tokens. Example:\n\n B# A# #B #A B#\n\nTo compute a program, we must rewrite neighbor tokens, using the rules:\n\n A# #A ... becomes ... nothing\n\n A# #B ... becomes ... #B A#\n\n B# #A ... becomes ... #A B#\n\n B# #B ... becomes ... nothing\n\nIn other words, whenever two neighbor tokens have their '#' facing each-other,\n\nthey must be rewritten according to the corresponding rule. For example, the\n\nfirst example shown here is computed as:\n\n B# A# #B #A B# =\n\n B# #B A# #A B# =\n\n A# #A B# =\n\n B#\n\nThe steps were:\n\n1. We replaced A# #B by #B A#.\n\n2. We replaced B# #B by nothing.\n\n3. We replaced A# #A by nothing.\n\nThe final result was just B#.\n\nNow, consider the following program:\n\nA# B# B# #A B# #A #B\n\nFully compute it, step by step.\"\n\n(The original post has tabs which I couldn't get working here)\n\nResolution criterion (important details)\n\nThis market will resolve YES if GPT-5 can solve these kinds of problems with good consistency (this will be judged by me) and NO if it can't. In the end all of the details will be judged using my best judgement but here are some important details / clarifications.\n\nSome important details:\n\nTo count, the puzzles will have to be sufficiently long (at least 20 in-game tokens). The given prompt has to be identical to the original tweets image except for the line after \"Now, consider the following program\". GPT-5 is not allowed to use external tools (what is counted as an external tool is decided by my best judgement). For example it is not allowed to write a python program and run it with the code interpreter. If GPT-5 has a built-in code interpreter (or something equivalent) that can't be turned off, the market will resolve as N/A.  The model has to be named GPT-5 (Or something very similar. This will again be decided by my best judgement). If OpenAI doesn't release a model called GPT-5 before 2030, the market will resolve as N/A. ","image":null,"icon":null,"outcomes":["YES","NO"],"outcome_prices":[0.143955,0.856045],"probability":0.143955,"spread":null,"active":true,"closed":false,"start_date":"2024-04-05T14:04:20.396000Z","end_date":"2030-01-01T21:59:00Z","closed_time":null,"volume":11987.393678529246,"volume_24hr":0.0,"prob_24h_change":0.0,"volume_24h_change":0.0,"normalized_vol_24hr":null,"normalized_volume":33.111305236816406,"liquidity":1010.0,"categories":["Science and Technology"],"countries":[],"updated_at":"2026-06-10T03:37:20.483894Z","fetched_at":"2026-05-05T13:29:58.281893Z","added_at":null,"url":"https://manifold.markets/Guuber3/will-gpt5-be-able-to-solve-ab-syste","event_title":"Will GPT-5 be able to solve A::B system puzzles consistently","chart_24h":[0.143955,0.143955]}],"_meta":{"attribution":"pdata.world — aggregated prediction-market data across 8 platforms","canonical_url":"https://pdata.world/events/manifold/OmLCYiszTZ3oHOn2bjAq","as_of":"2026-06-10T14:27:00.685984Z","docs":"https://api.pdata.world/docs","cite_as":"According to pdata.world (tracking Manifold): \"Will GPT-5 be able to solve A::B system puzzles consistently\" — top market at 14% probability across 1 outcome","source_url":null}}