{"id":2379,"date":"2025-06-05T09:20:42","date_gmt":"2025-06-05T09:20:42","guid":{"rendered":"https:\/\/diznr.com\/?p=2379"},"modified":"2025-06-05T09:20:42","modified_gmt":"2025-06-05T09:20:42","slug":"data-science-what-is-data-mining-complete-understanding-with-visualisation-real","status":"publish","type":"post","link":"https:\/\/www.reilsolar.com\/pdf\/data-science-what-is-data-mining-complete-understanding-with-visualisation-real\/","title":{"rendered":"DATA SCIENCE: what is data mining ( Complete understanding ) with real visualisation"},"content":{"rendered":"<p>DATA SCIENCE: what is data mining ( Complete understanding ) with real visualisation<\/p>\n\n<div class=\"wp-block-fv-player-gutenberg-basic\">[fvplayer id=&#8221;16&#8243;]<\/div>\n<div>\n<h3 data-start=\"0\" data-end=\"73\"><strong data-start=\"2\" data-end=\"71\">\u00a0Complete Understanding of Data Mining with Real Visualizations<\/strong><\/h3>\n<h3 data-start=\"75\" data-end=\"107\"><strong data-start=\"78\" data-end=\"105\">\u00a0What is Data Mining?<\/strong><\/h3>\n<p data-start=\"108\" data-end=\"384\"><strong data-start=\"108\" data-end=\"123\">Data Mining<\/strong> is the process of <strong data-start=\"142\" data-end=\"194\">extracting useful patterns, trends, and insights<\/strong> from large datasets using <strong data-start=\"221\" data-end=\"283\">statistical, mathematical, and machine learning techniques<\/strong>. It is widely used in <strong data-start=\"306\" data-end=\"347\">business, healthcare, finance, and AI<\/strong> to make <strong data-start=\"356\" data-end=\"381\">data-driven decisions<\/strong>.<\/p>\n<p data-start=\"386\" data-end=\"590\"><strong data-start=\"389\" data-end=\"415\">Think of it like this:<\/strong> Imagine you are digging for gold in a huge mountain of rocks. <strong data-start=\"478\" data-end=\"493\">Data Mining<\/strong> is the process of filtering out unnecessary data (rocks) and finding valuable patterns (gold).<\/p>\n<h3 data-start=\"597\" data-end=\"633\"><strong data-start=\"600\" data-end=\"631\">\u00a0Key Steps in Data Mining<\/strong><\/h3>\n<h3 data-start=\"634\" data-end=\"663\"><strong data-start=\"638\" data-end=\"661\">\u00a0Data Collection<\/strong><\/h3>\n<p data-start=\"664\" data-end=\"817\">\u00a0Gathering raw data from various sources like databases, social media, sensors, etc.<br data-start=\"749\" data-end=\"752\" \/><strong data-start=\"755\" data-end=\"767\">Example:<\/strong> E-commerce websites collect user browsing data.<\/p>\n<h3 data-start=\"819\" data-end=\"862\"><strong data-start=\"823\" data-end=\"860\">\u00a0Data Cleaning &amp; Preprocessing<\/strong><\/h3>\n<p data-start=\"863\" data-end=\"1018\">\u00a0Removing duplicates, handling missing values, and formatting data.<br data-start=\"931\" data-end=\"934\" \/><strong data-start=\"937\" data-end=\"949\">Example:<\/strong> If a dataset has missing customer details, we fill or remove them.<\/p>\n<h3 data-start=\"1020\" data-end=\"1073\"><strong data-start=\"1024\" data-end=\"1071\">\u00a0Data Transformation &amp; Feature Selection<\/strong><\/h3>\n<p data-start=\"1074\" data-end=\"1231\">\u00a0Converting data into a usable format and selecting key variables.<br data-start=\"1141\" data-end=\"1144\" \/><strong data-start=\"1147\" data-end=\"1159\">Example:<\/strong> Converting text reviews into numerical scores for sentiment analysis.<\/p>\n<h3 data-start=\"1233\" data-end=\"1281\"><strong data-start=\"1237\" data-end=\"1279\">\u00a0Pattern Discovery &amp; Model Building<\/strong><\/h3>\n<p data-start=\"1282\" data-end=\"1439\">\u00a0Using machine learning, clustering, classification, and association rule mining.<br data-start=\"1364\" data-end=\"1367\" \/><strong data-start=\"1370\" data-end=\"1382\">Example:<\/strong> Netflix recommends movies based on your watch history.<\/p>\n<h3 data-start=\"1441\" data-end=\"1482\"><strong data-start=\"1445\" data-end=\"1480\">\u00a0Evaluation &amp; Interpretation<\/strong><\/h3>\n<p data-start=\"1483\" data-end=\"1639\">\u00a0Checking model accuracy and extracting meaningful insights.<br data-start=\"1544\" data-end=\"1547\" \/><strong data-start=\"1550\" data-end=\"1562\">Example:<\/strong> A bank detects fraudulent transactions based on unusual spending behavior.<\/p>\n<h3 data-start=\"1646\" data-end=\"1693\"><strong data-start=\"1649\" data-end=\"1691\">\u00a0Real-Life Examples &amp; Visualizations<\/strong><\/h3>\n<h3 data-start=\"1695\" data-end=\"1759\"><strong data-start=\"1699\" data-end=\"1757\">\u00a01. Market Basket Analysis (Association Rule Mining)<\/strong><\/h3>\n<p data-start=\"1760\" data-end=\"1904\">\u00a0Used in <strong data-start=\"1770\" data-end=\"1793\">retail &amp; e-commerce<\/strong> to find customer buying patterns.<br data-start=\"1827\" data-end=\"1830\" \/><strong data-start=\"1833\" data-end=\"1845\">Example:<\/strong> If a customer buys bread, they are likely to buy butter.<\/p>\n<p data-start=\"1906\" data-end=\"2014\"><strong data-start=\"1909\" data-end=\"1927\">Visualization:<\/strong><br data-start=\"1927\" data-end=\"1930\" \/>Imagine a heatmap showing <strong data-start=\"1956\" data-end=\"1992\">frequently bought-together items<\/strong> in an online store.<\/p>\n<h3 data-start=\"2021\" data-end=\"2071\"><strong data-start=\"2025\" data-end=\"2069\">\u00a02. Customer Segmentation (Clustering)<\/strong><\/h3>\n<p data-start=\"2072\" data-end=\"2225\">\u00a0Used in <strong data-start=\"2082\" data-end=\"2095\">marketing<\/strong> to group customers based on behavior.<br data-start=\"2133\" data-end=\"2136\" \/><strong data-start=\"2139\" data-end=\"2151\">Example:<\/strong> Companies group customers based on age, location, and purchase history.<\/p>\n<p data-start=\"2227\" data-end=\"2331\"><strong data-start=\"2230\" data-end=\"2248\">Visualization:<\/strong><br data-start=\"2248\" data-end=\"2251\" \/>A scatter plot where different colored clusters represent <strong data-start=\"2309\" data-end=\"2328\">customer groups<\/strong>.<\/p>\n<h3 data-start=\"2338\" data-end=\"2406\"><strong data-start=\"2342\" data-end=\"2404\">\u00a03. Fraud Detection (Classification &amp; Anomaly Detection)<\/strong><\/h3>\n<p data-start=\"2407\" data-end=\"2533\">\u00a0Used in <strong data-start=\"2417\" data-end=\"2444\">banking &amp; cybersecurity<\/strong> to detect fraud.<br data-start=\"2461\" data-end=\"2464\" \/><strong data-start=\"2467\" data-end=\"2479\">Example:<\/strong> A system flags suspicious credit card transactions.<\/p>\n<p data-start=\"2535\" data-end=\"2635\"><strong data-start=\"2538\" data-end=\"2556\">Visualization:<\/strong><br data-start=\"2556\" data-end=\"2559\" \/>A time-series graph showing <strong data-start=\"2587\" data-end=\"2632\">normal transactions vs. fraudulent spikes<\/strong>.<\/p>\n<h3 data-start=\"2642\" data-end=\"2690\"><strong data-start=\"2646\" data-end=\"2688\">\u00a04. Sentiment Analysis (Text Mining)<\/strong><\/h3>\n<p data-start=\"2691\" data-end=\"2820\">\u00a0Used in <strong data-start=\"2701\" data-end=\"2746\">social media &amp; customer feedback analysis<\/strong>.<br data-start=\"2747\" data-end=\"2750\" \/><strong data-start=\"2753\" data-end=\"2765\">Example:<\/strong> Brands analyze customer reviews to improve products.<\/p>\n<p data-start=\"2822\" data-end=\"2915\"><strong data-start=\"2825\" data-end=\"2843\">Visualization:<\/strong><br data-start=\"2843\" data-end=\"2846\" \/>A word cloud showing <strong data-start=\"2867\" data-end=\"2901\">positive vs. negative keywords<\/strong> in reviews.<\/p>\n<h3 data-start=\"2922\" data-end=\"2974\"><strong data-start=\"2925\" data-end=\"2972\">\u00a0Tools &amp; Technologies Used in Data Mining<\/strong><\/h3>\n<p data-start=\"2975\" data-end=\"3254\"><strong data-start=\"2977\" data-end=\"2991\">Python &amp; R<\/strong> \u2013 For data analysis and visualization<br data-start=\"3029\" data-end=\"3032\" \/><strong data-start=\"3034\" data-end=\"3049\">SQL &amp; NoSQL<\/strong> \u2013 For database management<br data-start=\"3075\" data-end=\"3078\" \/><strong data-start=\"3080\" data-end=\"3127\">Machine Learning (Scikit-learn, TensorFlow)<\/strong> \u2013 For predictive modeling<br data-start=\"3153\" data-end=\"3156\" \/><strong data-start=\"3158\" data-end=\"3180\">Tableau &amp; Power BI<\/strong> \u2013 For data visualization<br data-start=\"3205\" data-end=\"3208\" \/><strong data-start=\"3210\" data-end=\"3228\">Hadoop &amp; Spark<\/strong> \u2013 For handling big data<\/p>\n<h3 data-start=\"3261\" data-end=\"3302\"><strong data-start=\"3264\" data-end=\"3300\">\u00a0Why is Data Mining Important?<\/strong><\/h3>\n<p data-start=\"3303\" data-end=\"3493\">\u00a0Helps in <strong data-start=\"3314\" data-end=\"3340\">better decision-making<\/strong><br data-start=\"3340\" data-end=\"3343\" \/>\u00a0Increases <strong data-start=\"3355\" data-end=\"3388\">business efficiency &amp; profits<\/strong><br data-start=\"3388\" data-end=\"3391\" \/>\u00a0Detects <strong data-start=\"3401\" data-end=\"3431\">fraud and security threats<\/strong><br data-start=\"3431\" data-end=\"3434\" \/>\u00a0Improves <strong data-start=\"3445\" data-end=\"3491\">customer experience &amp; marketing strategies<\/strong><\/p>\n<h3 data-start=\"3500\" data-end=\"3526\"><strong data-start=\"3503\" data-end=\"3524\">\u00a0Final Thoughts<\/strong><\/h3>\n<p data-start=\"3527\" data-end=\"3706\"><strong data-start=\"3530\" data-end=\"3575\">Data Mining = Turning Raw Data into GOLD!<\/strong><br data-start=\"3575\" data-end=\"3578\" \/>It is one of the most powerful tools for businesses and researchers to <strong data-start=\"3649\" data-end=\"3675\">unlock hidden insights<\/strong> and make informed decisions.<\/p>\n<p data-start=\"3708\" data-end=\"3804\" data-is-last-node=\"\" data-is-only-node=\"\">\u00a0<strong data-start=\"3711\" data-end=\"3804\" data-is-last-node=\"\">Want to see real visualizations? Let me know what dataset or use case interests you!<\/strong><\/p>\n<h3 data-start=\"3708\" data-end=\"3804\"><a href=\"https:\/\/www.vssut.ac.in\/lecture_notes\/lecture1428550844.pdf\" target=\"_blank\" rel=\"noopener\">DATA SCIENCE: what is data mining ( Complete understanding ) with real visualisation<\/a><\/h3>\n<h3 class=\"LC20lb MBeuO DKV0Md\"><a href=\"https:\/\/www.ceom.ou.edu\/media\/docs\/upload\/Pang-Ning_Tan_Michael_Steinbach_Vipin_Kumar_-_Introduction_to_Data_Mining-Pe_NRDK4fi.pdf\" target=\"_blank\" rel=\"noopener\">Introduction to Data Mining<\/a><\/h3>\n<h3 class=\"LC20lb MBeuO DKV0Md\"><a href=\"https:\/\/ai.stanford.edu\/~ronnyk\/naeMining.pdf\" target=\"_blank\" rel=\"noopener\">Data Mining and Visualization<\/a><\/h3>\n<h3 data-start=\"0\" data-end=\"105\">\ud83d\udcca <strong data-start=\"7\" data-end=\"105\">Complete Understanding of Data Mining in Data Science (With Real-World Visualization Examples)<\/strong><\/h3>\n<hr data-start=\"107\" data-end=\"110\" \/>\n<h2 data-start=\"112\" data-end=\"145\">\ud83e\udde0 <strong data-start=\"118\" data-end=\"145\">1. What is Data Mining?<\/strong><\/h2>\n<p data-start=\"147\" data-end=\"348\"><strong data-start=\"147\" data-end=\"162\">Data Mining<\/strong> is the process of <strong data-start=\"181\" data-end=\"257\">discovering hidden patterns, trends, correlations, or useful information<\/strong> from <strong data-start=\"263\" data-end=\"281\">large datasets<\/strong> using statistical methods, machine learning, and database systems.<\/p>\n<blockquote data-start=\"350\" data-end=\"436\">\n<p data-start=\"352\" data-end=\"436\">\ud83d\udd0d In simple terms: <strong data-start=\"372\" data-end=\"436\">Data Mining = Finding Gold (Insights) in a Mountain of Data.<\/strong><\/p>\n<\/blockquote>\n<hr data-start=\"438\" data-end=\"441\" \/>\n<h2 data-start=\"443\" data-end=\"502\">\ud83d\udee0\ufe0f <strong data-start=\"450\" data-end=\"502\">2. Why is Data Mining Important in Data Science?<\/strong><\/h2>\n<p data-start=\"504\" data-end=\"542\">In Data Science, data mining helps to:<\/p>\n<ul data-start=\"543\" data-end=\"687\">\n<li data-start=\"543\" data-end=\"571\">\n<p data-start=\"545\" data-end=\"571\">Make data-driven decisions<\/p>\n<\/li>\n<li data-start=\"572\" data-end=\"602\">\n<p data-start=\"574\" data-end=\"602\">Discover patterns and trends<\/p>\n<\/li>\n<li data-start=\"603\" data-end=\"628\">\n<p data-start=\"605\" data-end=\"628\">Predict future outcomes<\/p>\n<\/li>\n<li data-start=\"629\" data-end=\"656\">\n<p data-start=\"631\" data-end=\"656\">Detect fraud or anomalies<\/p>\n<\/li>\n<li data-start=\"657\" data-end=\"687\">\n<p data-start=\"659\" data-end=\"687\">Understand customer behavior<\/p>\n<\/li>\n<\/ul>\n<hr data-start=\"689\" data-end=\"692\" \/>\n<h2 data-start=\"694\" data-end=\"726\">\ud83d\udd04 <strong data-start=\"700\" data-end=\"726\">3. Data Mining Process<\/strong><\/h2>\n<p data-start=\"728\" data-end=\"764\">Here&#8217;s a step-by-step visualization:<\/p>\n<div class=\"contain-inline-size rounded-md border-[0.5px] border-token-border-medium relative bg-token-sidebar-surface-primary\">\n<div class=\"flex items-center text-token-text-secondary px-4 py-2 text-xs font-sans justify-between h-9 bg-token-sidebar-surface-primary dark:bg-token-main-surface-secondary select-none rounded-t-[5px]\">mathematica<\/div>\n<div class=\"sticky top-9\">\n<div class=\"absolute end-0 bottom-0 flex h-9 items-center pe-2\">\n<div class=\"bg-token-sidebar-surface-primary text-token-text-secondary dark:bg-token-main-surface-secondary flex items-center rounded-sm px-2 font-sans text-xs\"><button class=\"flex gap-1 items-center select-none px-4 py-1\" aria-label=\"Copy\">Copy<\/button><span class=\"\" data-state=\"closed\"><button class=\"flex items-center gap-1 px-4 py-1 select-none\">Edit<\/button><\/span><\/div>\n<\/div>\n<\/div>\n<div class=\"overflow-y-auto p-4\" dir=\"ltr\"><code class=\"whitespace-pre!\">\ud83d\udce5 <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-variable\">Collection<\/span>\n        \u2193\n\ud83e\uddf9 <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-variable\">Cleaning<\/span>\n        \u2193\n\ud83d\udcca <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-variable\">Integration<\/span>\n        \u2193\n\ud83d\udd0d <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-built_in\">Selection<\/span>\n        \u2193\n\u2699\ufe0f <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-variable\">Transformation<\/span>\n        \u2193\n\ud83e\udde0 <span class=\"hljs-variable\">Data<\/span> <span class=\"hljs-variable\">Mining<\/span>\n        \u2193\n\ud83d\udcc8 <span class=\"hljs-built_in\">Pattern<\/span> <span class=\"hljs-variable\">Evaluation<\/span>\n        \u2193\n\ud83d\udce4 <span class=\"hljs-variable\">Knowledge<\/span> <span class=\"hljs-variable\">Presentation<\/span>\n<\/code><\/div>\n<\/div>\n<hr data-start=\"1005\" data-end=\"1008\" \/>\n<h2 data-start=\"1010\" data-end=\"1057\">\ud83c\udfaf <strong data-start=\"1016\" data-end=\"1057\">4. Real-Life Visualization &amp; Examples<\/strong><\/h2>\n<h3 data-start=\"1059\" data-end=\"1104\">\ud83d\uded2 <strong data-start=\"1066\" data-end=\"1104\">A. Market Basket Analysis (Retail)<\/strong><\/h3>\n<p data-start=\"1106\" data-end=\"1170\"><strong data-start=\"1106\" data-end=\"1115\">Goal:<\/strong> Understand which items are frequently bought together.<\/p>\n<p data-start=\"1172\" data-end=\"1190\"><strong data-start=\"1172\" data-end=\"1190\">Visualization:<\/strong><\/p>\n<div class=\"contain-inline-size rounded-md border-[0.5px] border-token-border-medium relative bg-token-sidebar-surface-primary\">\n<div class=\"flex items-center text-token-text-secondary px-4 py-2 text-xs font-sans justify-between h-9 bg-token-sidebar-surface-primary dark:bg-token-main-surface-secondary select-none rounded-t-[5px]\">markdown<\/div>\n<div class=\"sticky top-9\">\n<div class=\"absolute end-0 bottom-0 flex h-9 items-center pe-2\">\n<div class=\"bg-token-sidebar-surface-primary text-token-text-secondary dark:bg-token-main-surface-secondary flex items-center rounded-sm px-2 font-sans text-xs\"><button class=\"flex gap-1 items-center select-none px-4 py-1\" aria-label=\"Copy\">Copy<\/button><span class=\"\" data-state=\"closed\"><button class=\"flex items-center gap-1 px-4 py-1 select-none\">Edit<\/button><\/span><\/div>\n<\/div>\n<\/div>\n<div class=\"overflow-y-auto p-4\" dir=\"ltr\"><code class=\"whitespace-pre!\">[Customer Buys] \u2192 {Bread, Milk}\n<span class=\"hljs-code\">                \u2192 {Bread, Diaper, Beer, Eggs}\n                \u2192 {Milk, Diaper, Beer, Cola}\n                \u2192 {Bread, Milk, Diaper, Beer}\n<\/span>\n\ud83e\udde0 Data Mining Rule Discovered:\n\"If customer buys Diaper \u2192 then likely to buy Beer\"\n\n\ud83d\udcc8 Used for:\n<span class=\"hljs-bullet\">-<\/span> Product placement\n<span class=\"hljs-bullet\">-<\/span> Targeted marketing\n<\/code><\/div>\n<\/div>\n<hr data-start=\"1509\" data-end=\"1512\" \/>\n<h3 data-start=\"1514\" data-end=\"1565\">\ud83c\udfe6 <strong data-start=\"1521\" data-end=\"1565\">B. Credit Card Fraud Detection (Banking)<\/strong><\/h3>\n<p data-start=\"1567\" data-end=\"1634\"><strong data-start=\"1567\" data-end=\"1576\">Goal:<\/strong> Detect suspicious transactions using pattern recognition.<\/p>\n<p data-start=\"1636\" data-end=\"1654\"><strong data-start=\"1636\" data-end=\"1654\">Visualization:<\/strong><\/p>\n<div class=\"contain-inline-size rounded-md border-[0.5px] border-token-border-medium relative bg-token-sidebar-surface-primary\">\n<div class=\"flex items-center text-token-text-secondary px-4 py-2 text-xs font-sans justify-between h-9 bg-token-sidebar-surface-primary dark:bg-token-main-surface-secondary select-none rounded-t-[5px]\">yaml<\/div>\n<div class=\"sticky top-9\">\n<div class=\"absolute end-0 bottom-0 flex h-9 items-center pe-2\">\n<div class=\"bg-token-sidebar-surface-primary text-token-text-secondary dark:bg-token-main-surface-secondary flex items-center rounded-sm px-2 font-sans text-xs\"><button class=\"flex gap-1 items-center select-none px-4 py-1\" aria-label=\"Copy\">Copy<\/button><span class=\"\" data-state=\"closed\"><button class=\"flex items-center gap-1 px-4 py-1 select-none\">Edit<\/button><\/span><\/div>\n<\/div>\n<\/div>\n<div class=\"overflow-y-auto p-4\" dir=\"ltr\"><code class=\"whitespace-pre!\"><span class=\"hljs-attr\">Normal Transaction:<\/span> \n  <span class=\"hljs-bullet\">-<\/span> <span class=\"hljs-attr\">Location:<\/span> <span class=\"hljs-string\">Home<\/span> <span class=\"hljs-string\">City<\/span>\n  <span class=\"hljs-bullet\">-<\/span> <span class=\"hljs-attr\">Amount:<\/span> <span class=\"hljs-string\">$30\u2013$200<\/span>\n\n<span class=\"hljs-attr\">Suspicious Transaction:<\/span>\n  <span class=\"hljs-bullet\">-<\/span> <span class=\"hljs-attr\">Location:<\/span> <span class=\"hljs-string\">Abroad<\/span>\n  <span class=\"hljs-bullet\">-<\/span> <span class=\"hljs-attr\">Amount:<\/span> <span class=\"hljs-string\">$5000<\/span> <span class=\"hljs-string\">at<\/span> <span class=\"hljs-number\">3<\/span> <span class=\"hljs-string\">AM<\/span>\n\n<span class=\"hljs-string\">\ud83d\udcc9<\/span> <span class=\"hljs-string\">Outlier<\/span> <span class=\"hljs-string\">Detected<\/span> <span class=\"hljs-string\">\u2192<\/span> <span class=\"hljs-string\">Flag<\/span> <span class=\"hljs-string\">for<\/span> <span class=\"hljs-string\">Review<\/span> <span class=\"hljs-string\">\ud83d\udea8<\/span>\n<\/code><\/div>\n<\/div>\n<hr data-start=\"1844\" data-end=\"1847\" \/>\n<h3 data-start=\"1849\" data-end=\"1896\">\ud83d\udcac <strong data-start=\"1856\" data-end=\"1896\">C. Sentiment Analysis (Social Media)<\/strong><\/h3>\n<p data-start=\"1898\" data-end=\"1957\"><strong data-start=\"1898\" data-end=\"1907\">Goal:<\/strong> Understand public opinion by mining tweets\/posts.<\/p>\n<p data-start=\"1959\" data-end=\"1977\"><strong data-start=\"1959\" data-end=\"1977\">Visualization:<\/strong><\/p>\n<div class=\"contain-inline-size rounded-md border-[0.5px] border-token-border-medium relative bg-token-sidebar-surface-primary\">\n<div class=\"flex items-center text-token-text-secondary px-4 py-2 text-xs font-sans justify-between h-9 bg-token-sidebar-surface-primary dark:bg-token-main-surface-secondary select-none rounded-t-[5px]\">mathematica<\/div>\n<div class=\"sticky top-9\">\n<div class=\"absolute end-0 bottom-0 flex h-9 items-center pe-2\">\n<div class=\"bg-token-sidebar-surface-primary text-token-text-secondary dark:bg-token-main-surface-secondary flex items-center rounded-sm px-2 font-sans text-xs\"><button class=\"flex gap-1 items-center select-none px-4 py-1\" aria-label=\"Copy\">Copy<\/button><span class=\"\" data-state=\"closed\"><button class=\"flex items-center gap-1 px-4 py-1 select-none\">Edit<\/button><\/span><\/div>\n<\/div>\n<\/div>\n<div class=\"overflow-y-auto p-4\" dir=\"ltr\"><code class=\"whitespace-pre!\"><span class=\"hljs-string\">\"Product X is amazing!\"<\/span> \u2192 <span class=\"hljs-built_in\">Positive<\/span> \ud83d\ude0a\n<span class=\"hljs-string\">\"Totally disappointed with Product Y.\"<\/span> \u2192 <span class=\"hljs-built_in\">Negative<\/span> \ud83d\ude21\n\n\ud83d\udcca <span class=\"hljs-variable\">Visual<\/span> <span class=\"hljs-variable\">Output<\/span><span class=\"hljs-operator\">:<\/span>\n<span class=\"hljs-built_in\">Positive<\/span><span class=\"hljs-operator\">:<\/span> <span class=\"hljs-number\">72<\/span><span class=\"hljs-operator\">%<\/span>\n<span class=\"hljs-built_in\">Negative<\/span><span class=\"hljs-operator\">:<\/span> <span class=\"hljs-number\">18<\/span><span class=\"hljs-operator\">%<\/span>\n<span class=\"hljs-variable\">Neutral<\/span><span class=\"hljs-operator\">:<\/span> <span class=\"hljs-number\">10<\/span><span class=\"hljs-operator\">%<\/span>\n<\/code><\/div>\n<\/div>\n<hr data-start=\"2138\" data-end=\"2141\" \/>\n<h3 data-start=\"2143\" data-end=\"2188\">\ud83c\udfe5 <strong data-start=\"2150\" data-end=\"2188\">D. Disease Prediction (Healthcare)<\/strong><\/h3>\n<p data-start=\"2190\" data-end=\"2242\"><strong data-start=\"2190\" data-end=\"2199\">Goal:<\/strong> Predict disease risk from patient records.<\/p>\n<p data-start=\"2244\" data-end=\"2262\"><strong data-start=\"2244\" data-end=\"2262\">Visualization:<\/strong><\/p>\n<div class=\"contain-inline-size rounded-md border-[0.5px] border-token-border-medium relative bg-token-sidebar-surface-primary\">\n<div class=\"flex items-center text-token-text-secondary px-4 py-2 text-xs font-sans justify-between h-9 bg-token-sidebar-surface-primary dark:bg-token-main-surface-secondary select-none rounded-t-[5px]\">diff<\/div>\n<div class=\"sticky top-9\">\n<div class=\"absolute end-0 bottom-0 flex h-9 items-center pe-2\">\n<div class=\"bg-token-sidebar-surface-primary text-token-text-secondary dark:bg-token-main-surface-secondary flex items-center rounded-sm px-2 font-sans text-xs\"><button class=\"flex gap-1 items-center select-none px-4 py-1\" aria-label=\"Copy\">Copy<\/button><span class=\"\" data-state=\"closed\"><button class=\"flex items-center gap-1 px-4 py-1 select-none\">Edit<\/button><\/span><\/div>\n<\/div>\n<\/div>\n<div class=\"overflow-y-auto p-4\" dir=\"ltr\"><code class=\"whitespace-pre!\">Input:\n<span class=\"hljs-deletion\">- Age<\/span>\n<span class=\"hljs-deletion\">- BMI<\/span>\n<span class=\"hljs-deletion\">- Blood Pressure<\/span>\n<span class=\"hljs-deletion\">- Glucose Levels<\/span>\n\nModel:\n<span class=\"hljs-deletion\">- Uses past patient data (Training)<\/span>\n\nOutput:\n<span class=\"hljs-deletion\">- 85% probability of diabetes<\/span>\n\n\ud83c\udfe5 Action: Schedule preventive care\n<\/code><\/div>\n<\/div>\n<hr data-start=\"2445\" data-end=\"2448\" \/>\n<h2 data-start=\"2450\" data-end=\"2492\">\ud83e\uddf0 <strong data-start=\"2456\" data-end=\"2492\">5. Common Data Mining Techniques<\/strong><\/h2>\n<div class=\"_tableContainer_16hzy_1\">\n<div class=\"_tableWrapper_16hzy_14 group flex w-fit flex-col-reverse\">\n<table class=\"w-fit min-w-(--thread-content-width)\" data-start=\"2494\" data-end=\"3421\">\n<thead data-start=\"2494\" data-end=\"2609\">\n<tr data-start=\"2494\" data-end=\"2609\">\n<th data-start=\"2494\" data-end=\"2519\" data-col-size=\"sm\">Technique<\/th>\n<th data-start=\"2519\" data-end=\"2566\" data-col-size=\"sm\">Purpose<\/th>\n<th data-start=\"2566\" data-end=\"2609\" data-col-size=\"sm\">Example<\/th>\n<\/tr>\n<\/thead>\n<tbody data-start=\"2726\" data-end=\"3421\">\n<tr data-start=\"2726\" data-end=\"2841\">\n<td data-start=\"2726\" data-end=\"2751\" data-col-size=\"sm\"><strong data-start=\"2728\" data-end=\"2746\">Classification<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"2751\" data-end=\"2798\">Assign category labels<\/td>\n<td data-col-size=\"sm\" data-start=\"2798\" data-end=\"2841\">Spam vs. Non-Spam Emails<\/td>\n<\/tr>\n<tr data-start=\"2842\" data-end=\"2957\">\n<td data-start=\"2842\" data-end=\"2867\" data-col-size=\"sm\"><strong data-start=\"2844\" data-end=\"2858\">Clustering<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"2867\" data-end=\"2914\">Group similar data without labels<\/td>\n<td data-col-size=\"sm\" data-start=\"2914\" data-end=\"2957\">Customer Segmentation<\/td>\n<\/tr>\n<tr data-start=\"2958\" data-end=\"3073\">\n<td data-start=\"2958\" data-end=\"2983\" data-col-size=\"sm\"><strong data-start=\"2960\" data-end=\"2981\">Association Rules<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"2983\" data-end=\"3030\">Discover co-occurring items<\/td>\n<td data-col-size=\"sm\" data-start=\"3030\" data-end=\"3073\">Market Basket Analysis<\/td>\n<\/tr>\n<tr data-start=\"3074\" data-end=\"3189\">\n<td data-start=\"3074\" data-end=\"3099\" data-col-size=\"sm\"><strong data-start=\"3076\" data-end=\"3097\">Anomaly Detection<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"3099\" data-end=\"3146\">Identify outliers or fraud<\/td>\n<td data-col-size=\"sm\" data-start=\"3146\" data-end=\"3189\">Credit Card Fraud<\/td>\n<\/tr>\n<tr data-start=\"3190\" data-end=\"3305\">\n<td data-start=\"3190\" data-end=\"3215\" data-col-size=\"sm\"><strong data-start=\"3192\" data-end=\"3206\">Regression<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"3215\" data-end=\"3262\">Predict numerical values<\/td>\n<td data-col-size=\"sm\" data-start=\"3262\" data-end=\"3305\">House Price Prediction<\/td>\n<\/tr>\n<tr data-start=\"3306\" data-end=\"3421\">\n<td data-start=\"3306\" data-end=\"3331\" data-col-size=\"sm\"><strong data-start=\"3308\" data-end=\"3323\">Text Mining<\/strong><\/td>\n<td data-col-size=\"sm\" data-start=\"3331\" data-end=\"3378\">Extract insights from text<\/td>\n<td data-col-size=\"sm\" data-start=\"3378\" data-end=\"3421\">Sentiment Analysis on Reviews<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"sticky end-(--thread-content-margin) h-0 self-end select-none\">\n<div class=\"absolute end-0 flex items-end\">\u00a0<\/div>\n<\/div>\n<\/div>\n<\/div>\n<hr data-start=\"3423\" data-end=\"3426\" \/>\n<h2 data-start=\"3428\" data-end=\"3466\">\ud83d\udcd8 <strong data-start=\"3434\" data-end=\"3466\">6. Tools Used in Data Mining<\/strong><\/h2>\n<ul data-start=\"3468\" data-end=\"3629\">\n<li data-start=\"3468\" data-end=\"3503\">\n<p data-start=\"3470\" data-end=\"3503\"><strong data-start=\"3470\" data-end=\"3503\">Python (pandas, scikit-learn)<\/strong><\/p>\n<\/li>\n<li data-start=\"3504\" data-end=\"3526\">\n<p data-start=\"3506\" data-end=\"3526\"><strong data-start=\"3506\" data-end=\"3526\">R (caret, rpart)<\/strong><\/p>\n<\/li>\n<li data-start=\"3527\" data-end=\"3543\">\n<p data-start=\"3529\" data-end=\"3543\"><strong data-start=\"3529\" data-end=\"3543\">RapidMiner<\/strong><\/p>\n<\/li>\n<li data-start=\"3544\" data-end=\"3554\">\n<p data-start=\"3546\" data-end=\"3554\"><strong data-start=\"3546\" data-end=\"3554\">WEKA<\/strong><\/p>\n<\/li>\n<li data-start=\"3555\" data-end=\"3599\">\n<p data-start=\"3557\" data-end=\"3599\"><strong data-start=\"3557\" data-end=\"3599\">Tableau \/ Power BI (for visualization)<\/strong><\/p>\n<\/li>\n<li data-start=\"3600\" data-end=\"3629\">\n<p data-start=\"3602\" data-end=\"3629\"><strong data-start=\"3602\" data-end=\"3629\">SQL (for data querying)<\/strong><\/p>\n<\/li>\n<\/ul>\n<hr data-start=\"3631\" data-end=\"3634\" \/>\n<h2 data-start=\"3636\" data-end=\"3649\">\ud83d\udcda Summary<\/h2>\n<div class=\"_tableContainer_16hzy_1\">\n<div class=\"_tableWrapper_16hzy_14 group flex w-fit flex-col-reverse\">\n<table class=\"w-fit min-w-(--thread-content-width)\" data-start=\"3651\" data-end=\"4180\">\n<thead data-start=\"3651\" data-end=\"3726\">\n<tr data-start=\"3651\" data-end=\"3726\">\n<th data-start=\"3651\" data-end=\"3674\" data-col-size=\"sm\">Feature<\/th>\n<th data-start=\"3674\" data-end=\"3726\" data-col-size=\"md\">Description<\/th>\n<\/tr>\n<\/thead>\n<tbody data-start=\"3803\" data-end=\"4180\">\n<tr data-start=\"3803\" data-end=\"3878\">\n<td data-start=\"3803\" data-end=\"3826\" data-col-size=\"sm\">Definition<\/td>\n<td data-col-size=\"md\" data-start=\"3826\" data-end=\"3878\">Finding patterns from large data sets<\/td>\n<\/tr>\n<tr data-start=\"3879\" data-end=\"3954\">\n<td data-start=\"3879\" data-end=\"3902\" data-col-size=\"sm\">Goal<\/td>\n<td data-col-size=\"md\" data-start=\"3902\" data-end=\"3954\">Extract useful knowledge<\/td>\n<\/tr>\n<tr data-start=\"3955\" data-end=\"4029\">\n<td data-start=\"3955\" data-end=\"3978\" data-col-size=\"sm\">Techniques Used<\/td>\n<td data-col-size=\"md\" data-start=\"3978\" data-end=\"4029\">Classification, Clustering, Association, etc.<\/td>\n<\/tr>\n<tr data-start=\"4030\" data-end=\"4104\">\n<td data-start=\"4030\" data-end=\"4053\" data-col-size=\"sm\">Application Fields<\/td>\n<td data-col-size=\"md\" data-start=\"4053\" data-end=\"4104\">Retail, Finance, Healthcare, Social Media<\/td>\n<\/tr>\n<tr data-start=\"4105\" data-end=\"4180\">\n<td data-start=\"4105\" data-end=\"4128\" data-col-size=\"sm\">Outcome<\/td>\n<td data-col-size=\"md\" data-start=\"4128\" data-end=\"4180\">Actionable insights for business or research<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"sticky end-(--thread-content-margin) h-0 self-end select-none\">\n<div class=\"absolute end-0 flex items-end\">\u00a0<\/div>\n<\/div>\n<\/div>\n<\/div>\n<hr data-start=\"4182\" data-end=\"4185\" \/>\n<p data-start=\"4187\" data-end=\"4310\" data-is-last-node=\"\" data-is-only-node=\"\">Would you like a visual infographic or a real dashboard example (e.g. Power BI \/ Tableau format) to understand this better?<\/p>\n<h3 data-start=\"4187\" data-end=\"4310\"><a href=\"https:\/\/nriit.edu.in\/files\/IT-Notes\/DWDM\/UNIT-2.pdf\" target=\"_blank\" rel=\"noopener\">DATA SCIENCE: what is data mining ( Complete understanding ) with real visualisation<\/a><\/h3>\n<h3 class=\"LC20lb MBeuO DKV0Md\"><a href=\"https:\/\/myweb.sabanciuniv.edu\/rdehkharghani\/files\/2016\/02\/The-Morgan-Kaufmann-Series-in-Data-Management-Systems-Jiawei-Han-Micheline-Kamber-Jian-Pei-Data-Mining.-Concepts-and-Techniques-3rd-Edition-Morgan-Kaufmann-2011.pdf\" target=\"_blank\" rel=\"noopener\">Data Mining. Concepts and Techniques, 3rd Edition (The &#8230;<\/a><\/h3>\n<h3 class=\"LC20lb MBeuO DKV0Md\"><a href=\"https:\/\/sriindu.ac.in\/wp-content\/uploads\/2023\/10\/R18CSE4102-Data-Mining.pdf\" target=\"_blank\" rel=\"noopener\">R18CSE4102-Data-Mining.pdf<\/a><\/h3>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>DATA SCIENCE: what is data mining ( Complete understanding ) with real visualisation<\/p>\n","protected":false},"author":64,"featured_media":2377,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[133,1368],"tags":[],"class_list":["post-2379","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-computer-science","category-seo"],"_links":{"self":[{"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/posts\/2379","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/users\/64"}],"replies":[{"embeddable":true,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/comments?post=2379"}],"version-history":[{"count":0,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/posts\/2379\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/media\/2377"}],"wp:attachment":[{"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/media?parent=2379"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/categories?post=2379"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.reilsolar.com\/pdf\/wp-json\/wp\/v2\/tags?post=2379"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}