{"id":218,"date":"2024-07-08T10:05:10","date_gmt":"2024-07-08T05:05:10","guid":{"rendered":"https:\/\/augurytech.co.uk\/courses\/?p=218"},"modified":"2024-10-10T13:00:02","modified_gmt":"2024-10-10T08:00:02","slug":"use-and-misuse-of-data","status":"publish","type":"post","link":"https:\/\/augurytech.co.uk\/courses\/artificial-intelligence\/use-and-misuse-of-data\/","title":{"rendered":"Use and Misuse of Data"},"content":{"rendered":"\n<p>I need three years to build up my IT team because we collect a large amount of data. The data collection process alone can take months or even years. After three years, I will have a perfect dataset. However, when presenting the data to the AI team, they might find flaws, gaps, or something missing that could make all the hard work during those years and months useless.<\/p>\n\n\n\n<p>We&#8217;ll do AI then <strong>what&#8217;s wrong with this approach?<\/strong> <strong>It turns out that&#8217;s a really bad strategy.<\/strong><\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Why This Approach Is Wrong<\/h3>\n\n\n\n<p>Waiting to involve your AI team until after data collection is complete is not the best strategy. Instead, start showing your collected data to the AI team as soon as possible. It allows the AI team to provide feedback on the data types to collect and the necessary IT infrastructure to build.<\/p>\n\n\n\n<p><strong>Example Scenario:<\/strong> Maybe an AI team can look at your factory data and say, &#8220;Hey, You know what? If you can collect data from this manufacturing machine, Not just once every 10 minutes, but instead once every minute, then we could do a better job building a preventative maintenance system for you.&#8221;<\/p>\n\n\n\n<h4 class=\"wp-block-heading has-accent-color has-text-color has-link-color wp-elements-35e2fa154af94619a2de0a2bb31b4fbf\">Common Misconceptions<\/h4>\n\n\n\n<p><strong>Another misconception is that You have so much data. Surely, the AI team can make it valuable.<\/strong> You have it in the note, but we just discussed that the data has not been shared with the team yet. Only the team knows better whether the data you want to use for that project is accurate or not.<\/p>\n\n\n\n<h4 class=\"wp-block-heading has-accent-color has-text-color has-link-color wp-elements-745a18c5a521142f07a8143e1300c879\"><strong>The Importance of Data Quality<\/strong><\/h4>\n\n\n\n<p><strong>Data is valuable or invaluable<\/strong>, but mistakes can occur during data collection. Even when conducting surveys and accurately collecting data, errors can still happen during data entry into the database.<\/p>\n\n\n\n<p>If you have a large amount of data, but bad data, then AI will learn inaccurate things. This will be a problem because Data is now Messy.<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-d0b3c9c8 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<p><strong> Data problems<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Inaccurate labels<\/li>\n\n\n\n<li>Missing values.<\/li>\n<\/ul>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<p><strong>Multiple types of data<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Unstructured Data: Images, Audio, text.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Conclusion<\/h3>\n\n\n\n<p style=\"font-size:clamp(1.125rem, 1.125rem + ((1vw - 0.2rem) * 0.612), 1.5rem);\">Involving your AI team early in the data collection can save time and resources. You can build a more robust and effective AI system by getting feedback on data types and collection methods. <\/p>\n\n\n\n<p style=\"font-size:clamp(1.125rem, 1.125rem + ((1vw - 0.2rem) * 0.612), 1.5rem);\">Remember, the quality of your data is just as important as the quantity.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I need three years to build up my IT team because we collect a large amount of data. The data collection process alone can take months or even years. After three years, I will have a perfect dataset. However, when presenting the data to the AI team, they might find flaws, gaps, or something missing [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"artificial-intelligence","format":"standard","meta":{"_seopress_robots_primary_cat":"none","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[2,4],"tags":[],"class_list":{"0":"post-218","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-artificial-intelligence","7":"category-introduction-to-artificial-intelligence"},"_links":{"self":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/218","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/comments?post=218"}],"version-history":[{"count":20,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/218\/revisions"}],"predecessor-version":[{"id":1167,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/218\/revisions\/1167"}],"wp:attachment":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/media?parent=218"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/categories?post=218"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/tags?post=218"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}