{"version": "1.0", "type": "rich", "title": "Okay, so you know how search engine results on most popular topics have become useless because the top results are cluttered...", "author_name": "kontextmaschine", "author_url": "https://kontextmaschine.com", "provider_name": "kontextmaschine", "provider_url": "https://kontextmaschine.com", "url": "https://kontextmaschine.com/post/706728589122519040/", "html": "<p><a class=\"tumblr_blog\" href=\"https://prokopetz.tumblr.com/post/706668024467423232/you-think-im-joking-i-am-not\" target=\"_blank\">prokopetz</a>:</p><blockquote><p><a class=\"tumblr_blog\" href=\"https://prokopetz.tumblr.com/post/706667666006573056/okay-so-you-know-how-search-engine-results-on\" target=\"_blank\">prokopetz</a>:</p><blockquote><p>Okay, so you know how search engine results on most popular topics have become useless because the top results are cluttered with page after page of machine-generated gibberish designed to trick people into clicking in so it can harvest their ad views?</p><p>And you know how the data sets that are used to train these gibberish-generating AIs are <b>themselves</b> typically machine-generated, via web scrapers using keyword recognition to sort text lifted from wiki articles and blog posts into topical subsets?</p><p>Well, today I discovered \u2013 quite by accident \u2013 that the training-data-gathering robots apparently cannot tell the difference between wiki articles about pop-psych personality typologies (e.g., Myers-Briggs type indicators, etc.) and wiki articles about <i>Homestuck</i> classpects.</p><p>The upshot is that when a bot that&rsquo;s been trained on the resulting data sets is instructed to write fake mental health resource articles, sometimes it will start telling you about <i>Homestuck</i>.</p></blockquote><p>You think I am joking:</p><div class=\"npf_row\"><figure class=\"tmblr-full\" data-orig-height=\"700\" data-orig-width=\"1015\"><img src=\"/media/0c9226e5d9db2aabdd24de6dcf975bef27f559ee_e323f9e48995.png\" data-orig-height=\"700\" data-orig-width=\"1015\" srcset=\"/media/bcc5995c3bff7efa93543e3f7dc7070efe8d98af_71e5dbf1a56b.png 75w, /media/59d4ee369d80f04a6dd4a14e0df96033586c64c1_7d5a88baa74f.png 100w, /media/f197d5c30ad9dc2474f68d554f27590bd7360d04_df7d6e05b753.png 250w, /media/cdff735d918e3c54f2f54b2b05ab420fe6c53ed8_130876f13fa6.png 400w, /media/1f55a7b5c6e291dee7920dd60c7fda8862d49287_981b1ff0cd5f.png 500w, /media/39c68b77c77f131f842e39970cf0a6fca770bec9_08c61abfd3b1.png 540w, /media/0c9226e5d9db2aabdd24de6dcf975bef27f559ee_e323f9e48995.png 640w, /media/d56aa3548b9cd20d10cbe07fd2ebd2421c72dc5d_ebc334818d1d.png 1015w\" sizes=\"(max-width: 1015px) 100vw, 1015px\"/></figure></div><p>I am not.</p></blockquote>", "thumbnail_url": "https://kontextmaschine.com/media/0c9226e5d9db2aabdd24de6dcf975bef27f559ee_e323f9e48995.png", "thumbnail_width": 640, "thumbnail_height": 441}