دوره 8، شماره 2 - ( 4-1399 )                   جلد 8 شماره 2 صفحات 22-9 | برگشت به فهرست نسخه ها


XML English Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Roozbeh M, Maanavi M, Babaie-Kafaki S. Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data. Iran J Health Sci 2020; 8 (2) :9-22
URL: http://jhs.mazums.ac.ir/article-1-703-fa.html
Robust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data. علوم بهداشتی ایران. 1399; 8 (2) :9-22

URL: http://jhs.mazums.ac.ir/article-1-703-fa.html


چکیده:   (2196 مشاهده)
Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variables. In addition, classical methods are affected by the presence of outliers and collinearity.
Methods: Nowadays, many real-world data sets carry structures of high-dimensional problems. To handle this problem, we used the least absolute shrinkage and selection operator (LASSO). Also, due to the flexibility and applicability of the semiparametric model in medical data, it can be used for modeling the genomic data. Motivated by these, here an improved robust approach in a high-dimensional data set was developed for the analysis of gene expression and prediction in the presence of outliers.
Results: Among the common problems in regression analysis, there was the problem of outliers. In the regression concept, an outlier is a point that fails to follow the main linear pattern of the data. The ordinary least-squares estimator was found potentially sensitive to the outliers; this fact provided necessary motivations to investigate robust estimations. Generally, the robust regression is among the most popular problems in the statistics community. In the present study, the least trimmed squares (LTS) estimation was applied to overcome the outlier problem.
Conclusions: We have proposed an optimization approach for semiparametric models to combat outliers in the data set. Especially, based on a penalization LASSO scheme, we have suggested a nonlinear integer programming problem as the semiparametric model which can be effectively solved by any evolutionary algorithm. We have also studied a real-world application related to the riboflavin production. The results showed that the proposed method was reasonably efficient in contrast to the LTS Method.
 
متن کامل [PDF 796 kb]   (1048 دریافت)    
نوع مطالعه: پژوهشي | موضوع مقاله: آمار زیستی

ارسال نظر درباره این مقاله : نام کاربری یا پست الکترونیک شما:
CAPTCHA

بازنشر اطلاعات
Creative Commons License این مقاله تحت شرایط Creative Commons Attribution-NonCommercial 4.0 International License قابل بازنشر است.

کلیه حقوق این وب سایت متعلق به می باشد.

طراحی و برنامه نویسی : یکتاوب افزار شرق

© 2024 CC BY-NC 4.0 | Iranian Journal of Health Sciences

Designed & Developed by : Yektaweb