{"id":188,"date":"2024-07-05T15:25:28","date_gmt":"2024-07-05T10:25:28","guid":{"rendered":"https:\/\/augurytech.co.uk\/courses\/?p=188"},"modified":"2024-10-10T09:56:42","modified_gmt":"2024-10-10T04:56:42","slug":"acquiring-data","status":"publish","type":"post","link":"https:\/\/augurytech.co.uk\/courses\/artificial-intelligence\/acquiring-data\/","title":{"rendered":"Acquiring Data"},"content":{"rendered":"\n<p>We need Data to create a machine-learning model.<\/p>\n\n\n\n<p><strong>Acquiring Data or Data Acquisition<\/strong> is the first and one of the most crucial steps in any data analysis or machine learning project. It involves gathering, collecting and measuring information from various sources to draw meaningful insights. We used this data for analysis, modelling, and decision-making. The quality and relevance of the acquired data directly impact the analysis outcomes.<\/p>\n\n\n\n<h4 class=\"wp-block-heading has-accent-color has-text-color has-link-color wp-elements-bc7f06c9128fe4f5f307033d9b7fb1e9\">Importance of Data Acquisition<\/h4>\n\n\n\n<p><strong>Good data acquisition is essential<\/strong> for reliable insights and predictions because it lays the foundation for data analysis and machine learning. Poor data acquisition can lead to incorrect conclusions and flawed decision-making.<\/p>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-group is-vertical is-content-justification-stretch is-layout-flex wp-container-core-group-is-layout-353c4f5a wp-block-group-is-layout-flex\">\n<h3 class=\"wp-block-heading\">For data acquisition:<\/h3>\n\n\n\n<p><strong>Manual labelling:<\/strong> The label is the output provided with the data to make it understandable to the machine. Data is collected and manually labelled on the other side.<\/p>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-fe9cc265 wp-block-group-is-layout-flex\">\n<p><strong>From observing human behaviours<\/strong>: Human behaviour is that we perform certain activities in many places. When we visit a website, our actions are recorded in a log. The log includes our searches, the time of our visit, whether we purchase on a commerce website, the purchase amount, the items purchased, and where the purchase was made.<\/p>\n\n\n\n<figure class=\"wp-block-table is-style-stripes\"><table class=\"has-border-color has-base-border-color has-fixed-layout\" style=\"border-width:1px\"><thead><tr><th class=\"has-text-align-center\" data-align=\"center\">User ID<\/th><th class=\"has-text-align-center\" data-align=\"center\">Time<\/th><th class=\"has-text-align-center\" data-align=\"center\">Price ($)<\/th><th class=\"has-text-align-center\" data-align=\"center\">Purchased<\/th><\/tr><\/thead><tbody><tr><td class=\"has-text-align-center\" data-align=\"center\">4783<\/td><td class=\"has-text-align-center\" data-align=\"center\">21 Jan 8:15:20<\/td><td class=\"has-text-align-center\" data-align=\"center\">7.95<\/td><td class=\"has-text-align-center\" data-align=\"center\">yes<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">3893<\/td><td class=\"has-text-align-center\" data-align=\"center\">3 Mar 11:30:15<\/td><td class=\"has-text-align-center\" data-align=\"center\">10<\/td><td class=\"has-text-align-center\" data-align=\"center\">yes<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">7453<\/td><td class=\"has-text-align-center\" data-align=\"center\">11 Jun 14:15:5<\/td><td class=\"has-text-align-center\" data-align=\"center\">9.5<\/td><td class=\"has-text-align-center\" data-align=\"center\">no<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">931<\/td><td class=\"has-text-align-center\" data-align=\"center\">2 Aug 20:30:55<\/td><td class=\"has-text-align-center\" data-align=\"center\">12.9<\/td><td class=\"has-text-align-center\" data-align=\"center\">yes<\/td><\/tr><\/tbody><\/table><\/figure>\n<\/div>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-fe9cc265 wp-block-group-is-layout-flex\">\n<p><strong>From observing behaviours of Machines<\/strong>: Data can be obtained from the machine, and logs can be created for any industrial machine to monitor their performance on various variables.<\/p>\n\n\n\n<p><strong>Download from websites \/ partnerships<\/strong>:   Thanks to the open internet, you can find so many datasets available for free online.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li> Computer vision or Image datasets.<\/li>\n\n\n\n<li> Self-driving car datasets.<\/li>\n\n\n\n<li> Speech recognition datasets<\/li>\n\n\n\n<li> Medical imaging datasets.<\/li>\n<\/ul>\n<\/div>\n\n\n\n<div style=\"height:0px\" aria-hidden=\"true\" class=\"wp-block-spacer wp-container-content-b30579f5\"><\/div>\n\n\n\n<p><strong>Surveys and Questionnaires<\/strong> are familiar methods for collecting data directly from individuals. They can be conducted online, via phone, or in person. This method is valid for gathering specific information from a targeted audience.<\/p>\n\n\n\n<p><strong>Web Scraping<\/strong> involves extracting data from websites. This method is valid for collecting large amounts of data from online sources. However, it\u2019s necessary to ensure that <em>web scraping complies with legal and ethical guidelines<\/em>.<\/p>\n\n\n\n<p><strong>APIs (Application Programming Interfaces)<\/strong> allow you to access data from various online services and databases. Many organizations provide APIs to share their data with developers and researchers.<\/p>\n\n\n\n<p><strong>Databases<\/strong> are structured data collections that can be easily accessed, managed, and updated. Data can be acquired from internal databases within an organization or from external databases available for public use.<\/p>\n\n\n\n<p><strong>Sensors and Internet of Things (IoT) devices <\/strong>collect data from the physical world. This method is commonly used in industries like manufacturing, healthcare, and smart cities.<\/p>\n<\/div>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-group is-vertical is-layout-flex wp-container-core-group-is-layout-fe9cc265 wp-block-group-is-layout-flex\">\n<h3 class=\"wp-block-heading\">Challenges in Acquiring Data<\/h3>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Data Quality: <\/strong>Ensuring the data accuracy, completeness, and consistency is a primary challenge. Poor quality data can lead to incorrect analysis and decisions.<\/li>\n\n\n\n<li><strong>Data Privacy and Security:<\/strong> Collecting data sometimes involves handling sensitive information. It\u2019s crucial to comply with data privacy laws and ensure data is securely stored and processed.<\/li>\n\n\n\n<li><strong>Data Integration:<\/strong> Combining data from different sources can be challenging due to differences in formats, structures, and quality. Effective data integration is necessary for comprehensive analysis.<\/li>\n<\/ol>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>We need Data to create a machine-learning model. Acquiring Data or Data Acquisition is the first and one of the most crucial steps in any data analysis or machine learning project. It involves gathering, collecting and measuring information from various sources to draw meaningful insights. We used this data for analysis, modelling, and decision-making. The [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"artificial-intelligence","format":"standard","meta":{"_seopress_robots_primary_cat":"none","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[2,4],"tags":[],"class_list":{"0":"post-188","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-artificial-intelligence","7":"category-introduction-to-artificial-intelligence"},"_links":{"self":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/188","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/comments?post=188"}],"version-history":[{"count":28,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/188\/revisions"}],"predecessor-version":[{"id":1143,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/posts\/188\/revisions\/1143"}],"wp:attachment":[{"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/media?parent=188"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/categories?post=188"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/augurytech.co.uk\/courses\/wp-json\/wp\/v2\/tags?post=188"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}