{"id":382,"date":"2023-03-15T19:06:39","date_gmt":"2023-03-15T19:06:39","guid":{"rendered":"https:\/\/procyonic.org\/blog\/?p=382"},"modified":"2023-03-16T15:18:27","modified_gmt":"2023-03-16T15:18:27","slug":"to-avoid-ai-hype-look-for-the-training-data-and-the-objective-function","status":"publish","type":"post","link":"https:\/\/procyonic.org\/blog\/to-avoid-ai-hype-look-for-the-training-data-and-the-objective-function\/","title":{"rendered":"To Avoid AI Hype look for the Training Data and the Objective Function"},"content":{"rendered":"\n<p>I&#8217;ve got to admit, ChatGPT is pretty amazing. And I&#8217;ve spent a few nights, as a technical professional and teacher, wondering how it might complicate my life. And I&#8217;m sure it will. But I think its easy to look at this text generating monster and get a little unhinged about the prospects for AI in the near future.<\/p>\n\n\n\n<p>One thing these models have convinced me of is that <em>if<\/em> there is training data for a task <em>then<\/em> I expect contemporary AI methods to eventually automate it. The internet is a giant pile of text and what we&#8217;ve seen is that a surprisingly small model, just billions of parameters, is sufficient to generate coherent text conditional on some previous text. While its <em>also<\/em> surprising how many tasks come down to just generating coherent text given a prompt, its also clear that the ability to do so doesn&#8217;t constitute general intelligence <em>except<\/em> of the kind that is represented specifically by generating plausible text.<\/p>\n\n\n\n<p>Four things have enabled these large language models: advances in computer technology, advances is the architecture of the neural networks that underlie them, and, most critically, the availability of a large training data set and a clear objective function. AI applications have succeeded wildly when these last two conditions have been met and continue to struggle where they cannot be. If you&#8217;re wondering what area might be disrupted next, look for the places where the objective function and data are available. Despite the ability to write code, for example, there isn&#8217;t a lot of training data out there on how to debug subtle problems coupled with a given company&#8217;s problem domain, for example. There are many areas of human endeavor where the material to train a machine to do the work simply isn&#8217;t available in any accessible form and where devising an objective function which covers a large number of cases is difficult.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/procyonic.org\/blog\/wp-content\/uploads\/2023\/03\/00000-3256100801.png\"><img loading=\"lazy\" decoding=\"async\" width=\"928\" height=\"512\" src=\"https:\/\/procyonic.org\/blog\/wp-content\/uploads\/2023\/03\/00000-3256100801.png\" alt=\"\" class=\"wp-image-385\" srcset=\"https:\/\/procyonic.org\/blog\/wp-content\/uploads\/2023\/03\/00000-3256100801.png 928w, https:\/\/procyonic.org\/blog\/wp-content\/uploads\/2023\/03\/00000-3256100801-300x166.png 300w, https:\/\/procyonic.org\/blog\/wp-content\/uploads\/2023\/03\/00000-3256100801-768x424.png 768w\" sizes=\"auto, (max-width: 928px) 100vw, 928px\" \/><\/a><figcaption>prompt: a laughing robot shoveling an enormous pile of paper into its gaping mouth like a salad with a fork &#8220;soviet art&#8221; abstract &#8220;line drawing&#8221;<\/figcaption><\/figure>\n\n\n\n<p>In the long term I don&#8217;t have any illusions. Human beings have general intelligence and are physically realized beings who manage to use that intelligence without any easily described global objective function. I&#8217;m sure one day we&#8217;ll figure this trick out or just steal it from nature. But large language models are, in a sense, just more of the same: statistical models that work because of a well specified objective function and voluminous training data. I don&#8217;t expect those requirements to change for the foreseeable future.<\/p>\n\n\n\n<p>Of course, the length of the foreseeable future gets shorter all the time.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;ve got to admit, ChatGPT is pretty amazing. And I&#8217;ve spent a few nights, as a technical professional and teacher, wondering how it might complicate my life. And I&#8217;m sure it will. But I think its [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-382","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/posts\/382","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/comments?post=382"}],"version-history":[{"count":3,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/posts\/382\/revisions"}],"predecessor-version":[{"id":388,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/posts\/382\/revisions\/388"}],"wp:attachment":[{"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/media?parent=382"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/categories?post=382"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/procyonic.org\/blog\/wp-json\/wp\/v2\/tags?post=382"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}