Jump to content

Ukuhlaziya Imininingo

Mayelana Wikipedia

Ukuhlaziya Imininingo kuyinqubo yokuhlola, yokuhlanza, yokuguqula, kanye nokwenza unongo lwemininingo ngenhloso yokuvubukula ukwaziswa okuwusizo, ukwazisa iziphetho, kanye nokusekela ukuthathwa kwezinqumo. [1] Ukuhlaziywa kwemininingo kunezici eziningi, ezihlanganisa ubuchule obunhlobonhlobo ngaphansi kwamagama ahlukene, futhi kuyasetshenziswa kwimihwebo ehlukene, kwezenzululwazi, nasezidlangaleni zenzululwazi yomphakathi. Ezweni lanamuhla lezohwebo, ukuhlaziywa kwemininingo kunendima ekwenzeni izinqumo zibe ezenzululwazi kakhudlwana nasekusizeni imihwebo (businesses) igidinge ngokufanele.

Ukuvukuza imininingo kungubuchule bokuhlaziywa kwemininingo ngokukhethekile obugxila ekunongeni izibalo nokuvubukukwa kolwazi ngezinjongo yokubikezela kunokuchazisa, kuyilapho ubuhlakani bohwebo buhlanganisa nokuhlaziywa kwemininingo encike kakhulu ekubumbutheni (aggregation), ngokugxila kakhulu ekwazisweni kohwebo. Ekusebenzeni kwezibalo, ukuhlaziywa kwemininingo kungahlukaniswa kube izibalo ezichazisayo (descriptive statistics), ukuhlaziya imininingo okuhlwayayo (exploratory data analysis (EDA)), kanye nokuhlaziya imininingo okuqinisekisayo (confirmatory data analysis (CDA)). i-EDA igxila ekuvubukuleni ulwangu olusha emininingweni kuyilapho i-CDA igxila ekuqinisekiseni noma ekumbuluziseni isihlawumbiselo esikhona. Uhlaziyo olubikezelayo lugxila ekusetshenzisweni konongo lezibalo ukuze kubikezelwe sakubhula noma ukujinjwa kokubikezelwa, kuyilapho ukuhlaziywa kwesilobo kusebenza ubuchule bezibalo, ulimi nohlaka ukugqoba nokujinja ukwaziswa emthonjeni yezilobo, okuwuhlobo lemininingo engahluziwe. Konke okungenhla kuyizinhlobo zokuhlaziywa kwemininingo.

Ukudidiyela imininingo kuyisandulela sokuhlaziya imininingo, futhi ukuhlaziywa kwemininingo kuxhumene eduze nomboniso wemininingo kanye nokuchakalaliswa kwemininingo.

Inqubo yokuhlaziya imininingo

[hlela | Hlela umthombo]
Isazibe somdludlungu wokuhlaziya imininingo ovela kwethi Doing Data Science, ka-Schutt no- O'Neil (2013)

Isihlaziyo sisho ukuhlukanisa okuphelele kube yizingcezu ezihlukene ukuze zihlolwe ngayinye.[2] Ukuhlaziya imininingo Umdludlungu wokuthola imininingo engahluziwe, kanye nokuyiguqula kube ukwaziswa okunosizo ekuthatyeni izinqumo.[1] Imininingo iyabuthwa futhi ihlaziywe ukuze kuphendulwe imibuzo, kuvivinywe izihlawumbiselo , noma kufakazelwe imichaciso.[3]

Ngowe-1961, usozibalomniningo uJohn Tukey, wachasisa ukuhlaziya imininingo ngokuthi:

"kuyizinqubo zokuhlaziya imininingo, ubuchule bokuhlakahla iziphumo zaleyonqubo, izindlela zokuhlela ukubuthwa imininingo ukuze ihlaziywe kalula, nangokunembile, nazo zonke izinguxa neziphumo zezibalomniningo ezisebenza ekuhlaziyweni kwemininingo."[4]

Kunezigaba ezimbalwa ezihlukanisekayo, ezichaziswa ngezansi. Lezi zigaba ziyaziphinda, ngokuthi isibikemphinda sezigaba ezisemuva siletha umsebenzi owengeziwe ezigabeni eziphambili.[5] Unongo olusetshenziselwa ukuvukuza imininingo elibizwa CRISP, nalo kunezigaba ezifanayo.

Imithombo

[hlela | Hlela umthombo]
  1. 1.0 1.1 "Transforming Unstructured Data into Useful Information", Big Data, Mining, and Analytics (Auerbach Publications): pp. 227–246, 2014-03-12, doi:10.1201/b16666-14, ISBN 978-0-429-09529-0, http://dx.doi.org/10.1201/b16666-14, retrieved 2021-05-29 
  2. Field, John (2009), "Dividing listening into its components", Listening in the Language Classroom (Cambridge: Cambridge University Press): pp. 96–109, doi:10.1017/cbo9780511575945.008, ISBN 978-0-511-57594-5, http://dx.doi.org/10.1017/cbo9780511575945.008, retrieved 2021-05-29 
  3. Judd, Charles; McCleland, Gary (1989). Data Analysis. Harcourt Brace Jovanovich. ISBN 0-15-516765-0.
  4. Tukey, John W. (March 1962). "John Tukey-The Future of Data Analysis-July 1961". The Annals of Mathematical Statistics 33 (1): 1–67. doi:10.1214/aoms/1177704711. http://projecteuclid.org/download/pdf_1/euclid.aoms/1177704711. Retrieved 2015-01-01. 
  5. Schutt, Rachel; O'Neil, Cathy (2013). Doing Data Science. O'Reilly Media. ISBN 978-1-449-35865-5.