Tabular: Added XGBoost Model by sackoh · Pull Request #691 · autogluon/autogluon

sackoh · 2020-10-03T18:29:27Z

Issue #, if available:

Tabular: Add XGBoost Model #589

Description of changes:

This code adds XGboost model to tabular predictor. I have tried to reference the existing LightGBM and Catboost models for ease of maintenance.

Features

Use Scikit-Learn API
- Since Augtogluon Tabular use Pandas DataFrame and Series as base data structures for tabular datasets, I chose sckit-learn API for now. And there are some functional benefits from it.
- and there are no differences with Learning API because the scikit-learn api is just wrapper and could use xgboost.train functions like callbacks, custom eval functions, continuous training.
Add OnehotFeatureGenerator
- XGBoost cannot handle categorical features (ref. Experimental support in version 1.3.0 which is not yet officially released)
Use Compressed sparse row matrix
- For the memory efficiency and training speed, use scipy.sparse.csr_matrix as training datasets
Referencing existing LightGBM and Catboost
- Custom early stopping during iterations
- Search space of hyperparameters

Tested

example_simple_tabular.py

AutoGluon training complete, total runtime = 9.31s ...
*** Summary of fit() ***
Estimated performance of each model:
                         model  score_val  pred_time_val  fit_time  pred_time_val_marginal  fit_time_marginal  stack_level  
can_infer  fit_order
0      weighted_ensemble_k0_l1       0.87       0.121174  0.911433                0.000879           0.366795            1       True         12
1           LightGBMClassifier       0.86       0.011571  0.142187                0.011571           0.142187            0       True          7
2          NeuralNetClassifier       0.86       0.021726  4.264337                0.021726           4.264337            0       True         10
3           CatboostClassifier       0.85       0.008813  0.526455                0.008813           0.526455            0       True          8
4     LightGBMClassifierCustom       0.85       0.010124  0.227517                0.010124           0.227517            0       True         11
5   RandomForestClassifierGini       0.83       0.107589  0.509310                0.107589           0.509310            0       True          1
6     ExtraTreesClassifierEntr       0.82       0.107103  0.401862                0.107103           0.401862            0       True          4
7   RandomForestClassifierEntr       0.82       0.107118  0.516674                0.107118           0.516674            0       True          2
8     ExtraTreesClassifierGini       0.82       0.108724  0.402451                0.108724           0.402451            0       True          3
9            XGBoostClassifier       0.81       0.013450  0.295061                0.013450           0.295061            0       True          9
10    KNeighborsClassifierUnif       0.80       0.102768  0.001823                0.102768           0.001823            0       True          5
11    KNeighborsClassifierDist       0.75       0.105135  0.001776                0.105135           0.001776            0       True          6
Number of models trained: 12
Types of models trained:
{'LGBModel', 'XGBoostModel', 'XTModel', 'RFModel', 'CatboostModel', 'KNNModel', 'WeightedEnsembleModel', 'TabularNeuralNetModel'}

example_advanced_tabular.py

AutoGluon training complete, total runtime = 14.83s ...
*** Summary of fit() ***
Estimated performance of each model:
                           model  score_val  pred_time_val  fit_time  pred_time_val_marginal  fit_time_marginal  stack_level  can_infer  fit_order
0        weighted_ensemble_k0_l1       0.95       0.092783  2.065839                0.000817           0.491808            1       True         16
1   NeuralNetClassifier/trial_14       0.90       0.003570  0.521244                0.003570           0.521244            0       True         15
2   NeuralNetClassifier/trial_13       0.90       0.003720  0.823196                0.003720           0.823196            0       True         14
3   NeuralNetClassifier/trial_10       0.90       0.003991  0.431074                0.003991           0.431074            0       True         11
4   NeuralNetClassifier/trial_12       0.85       0.003766  0.665812                0.003766           0.665812            0       True         13
5      XGBoostClassifier/trial_5       0.85       0.010449  0.118853                0.010449           0.118853            0       True          6
6     LightGBMClassifier/trial_2       0.85       0.077526  1.024104                0.077526           1.024104            0       True          3
7   NeuralNetClassifier/trial_11       0.80       0.004321  0.371678                0.004321           0.371678            0       True         12
8      XGBoostClassifier/trial_9       0.75       0.009217  0.084948                0.009217           0.084948            0       True         10
9      XGBoostClassifier/trial_8       0.75       0.009302  0.085183                0.009302           0.085183            0       True          9
10     XGBoostClassifier/trial_6       0.75       0.009354  0.082578                0.009354           0.082578            0       True          7
11     XGBoostClassifier/trial_7       0.75       0.014458  0.125601                0.014458           0.125601            0       True          8
12    LightGBMClassifier/trial_4       0.75       0.053471  0.730299                0.053471           0.730299            0       True          5
13    LightGBMClassifier/trial_1       0.75       0.062003  0.634721                0.062003           0.634721            0       True          2
14    LightGBMClassifier/trial_0       0.75       0.071609  0.834259                0.071609           0.834259            0       True          1
15    LightGBMClassifier/trial_3       0.75       0.093765  0.734004                0.093765           0.734004            0       True          4
Number of models trained: 16
Types of models trained:
{'XGBoostModel', 'WeightedEnsembleModel', 'LGBModel', 'TabularNeuralNetModel'}

TODO

Update docs & comments
Update logger for training
Add try import if needed
Add sample weight if needed

Future works

Continuous training
Dart
GPU supports

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…um_gpus`

szha · 2020-10-03T20:09:35Z

Job PR-691-1 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-691/1/index.html

Innixma · 2020-10-06T02:08:21Z

This looks awesome! Really nicely put together. I will take a deeper look tomorrow and provide review feedback, as well as pull this into my local machine and test it on some datasets.

Innixma

Added some initial comments. We are in code-freeze at present as we are working on a major modularization PR. This should be complete in 2 weeks at which point we can rebase this PR with mainline to prepare for merging. My initial basic tests and runs look good and I will plan to benchmark it more heavily after modularization is merged into mainline.

Thanks again for the high quality PR!

autogluon/utils/tabular/ml/models/xgboost/callbacks.py

autogluon/utils/tabular/ml/models/xgboost/hyperparameters/searchspaces.py

autogluon/utils/tabular/ml/models/xgboost/xgboost_model.py

autogluon/utils/tabular/ml/models/xgboost/callbacks.py

autogluon/utils/tabular/ml/models/xgboost/xgboost_model.py

sackoh · 2020-10-11T13:04:36Z

Sorry for late reply. I had a long vacation at last week.
I will check your comments and modify some codes.

Thanks!

autogluon/utils/tabular/ml/models/xgboost/xgboost_utils.py

setup.py

…very 50 steps

…parameters

Innixma

Thanks for the changes, looks good! Last thing before benchmarking will be rebasing with mainline. We are still moving a few things around after modularization and I will let you know later this week when it is ready to rebase.

Innixma · 2020-10-22T00:53:05Z

Mainline should be stable now, feel free to rebase.

Things to be aware of for rebase:

autogluon/utils/tabular/* -> tabular/src/autogluon/tabular/*

.../tabular/utils/ml/* -> .../tabular/*

.../tabular/utils/* -> .../tabular/*

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py

sackoh · 2020-11-01T11:30:52Z

@Innixma

Changed preprocess to _prerpocess with **kwargs (didn't change preprocess in _fit as you mentioned above)
Updated try_import_xgboost

szha · 2020-11-01T13:10:29Z

Job PR-691-11 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-691/11/index.html

Innixma

@sackoh Benchmark results are good! Here are the results:

AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier VS all
                                            framework  > AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier  < AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier  = AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier  % Less Avg. Errors Than AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier  time_train_s  metric_error  time_infer_s  loss_rescaled       rank  rank=1_count  rank=2_count  rank=3_count  rank>3_count  error_count
0      AutoGluon_4h_2020_11_02_xgb_CatboostClassifier                                               23                                               14                                                0                                           6.098552                        258.030727      0.275211      0.041744       0.072145   3.470588             8            10             3            16            0
1       AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier                                                0                                                0                                               37                                           0.000000                        608.679669      0.280103      1.821426       0.097839   3.926471             2             7             9            19            0
2    AutoGluon_4h_2020_11_02_xgb_LightGBMClassifierXT                                               20                                               16                                                1                                          -1.879454                         14.797889      0.308314      0.123659       0.119923   4.279412             3             5             8            21            0
3   AutoGluon_4h_2020_11_02_xgb_LightGBMClassifier...                                               13                                               21                                                0                                          -6.096218                         56.137859      0.304770      0.341352       0.127638   4.485294             7             2             2            23            3
4      AutoGluon_4h_2020_11_02_xgb_LightGBMClassifier                                                6                                               31                                                0                                          -9.069855                         16.127528      0.334640      0.082547       0.184649   5.485294             1             3             3            30            0
5     AutoGluon_4h_2020_11_02_xgb_NeuralNetClassifier                                               12                                               25                                                0                                         -10.257584                         87.022387      0.301530      0.737617       0.183621   6.441176             8             0             3            26            0
6   AutoGluon_4h_2020_11_02_xgb_RandomForestClassi...                                                9                                               27                                                0                                         -15.483156                         27.738800      0.353887      0.605831       0.199359   6.500000             1             3             1            31            1
7   AutoGluon_4h_2020_11_02_xgb_RandomForestClassi...                                                9                                               27                                                0                                         -16.195367                         17.687104      0.351978      1.356308       0.196360   6.661765             2             1             2            31            1
8   AutoGluon_4h_2020_11_02_xgb_ExtraTreesClassifi...                                                7                                               29                                                0                                         -17.629729                          6.924649      0.396179      1.014320       0.229516   7.117647             2             3             2            29            1
9   AutoGluon_4h_2020_11_02_xgb_ExtraTreesClassifi...                                                7                                               29                                                0                                         -18.391755                          6.582585      0.394828      1.222700       0.231477   7.367647             2             2             1            31            1
10  AutoGluon_4h_2020_11_02_xgb_KNeighborsClassifi...                                                2                                               35                                                0                                         -56.051629                          1.742805      0.887229      7.447910       0.903906  10.823529             1             0             1            35            0
11  AutoGluon_4h_2020_11_02_xgb_KNeighborsClassifi...                                                2                                               35                                                0                                         -56.862477                          1.741305      0.891297      7.526989       0.931777  11.441176             0             1             0            36            0

XGBoost is the 2nd best single model, beaten only by CatBoost on average. XGBoost had 0 failures which is great.

A few things to take note of:

XGBoost takes a very long time to train, averaging 2.5x longer than CatBoost and 40x longer than LightGBM. This could in part be due to the lower learning rate used (CatBoost and LightGBM are using 0.1, but they might do better with 0.03 as you did). CatBoost was previously the slowest model we had, so XGBoost is very slow compared to most of the models.
XGBoost takes a long time to infer. It is 20x slower than LightGBM and 50x slower than CatBoost. This is somewhat of a concern for users who want fast inference speed, so it might be good for us to look into ways to speed this up in future.

Because of these concerns, I'll recommend to not add XGBoost to the default configs yet until we deep dive more into potential solutions for training/inference speed (GPU acceleration perhaps?), and to avoid adding a new hard dependency into the requirements (will plan to add as optional dependency via pip install autogluon.tabular['xgboost'] and be part of a future pip install autogluon.tabular['full'] alongside FastAI, Torch, and other models.

I've added other comments in to improve the current code. As an example, early stopping is still using the final iteration instead of the best iteration, which once fixed will likely improve XGBoost's performance by a good margin. Once these comments are addressed, I'll be happy to approve and merge the PR!

Thanks again for all the work that went into this!

tabular/src/autogluon/tabular/task/tabular_prediction/hyperparameter_configs.py

tabular/src/autogluon/tabular/task/tabular_prediction/tabular_prediction.py

Innixma · 2020-11-03T20:00:13Z

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py

+                    if feature == original_feature:
+                        importance_dict[feature] += value
+
+        return importance_dict


Can you add (requires rebase first):

from ...features.feature_metadata import R_OBJECT def _get_default_auxiliary_params(self) -> dict: default_auxiliary_params = super()._get_default_auxiliary_params() extra_auxiliary_params = dict( ignored_type_group_raw=[R_OBJECT], ) default_auxiliary_params.update(extra_auxiliary_params) return default_auxiliary_params

This way the model will no longer crash if given object dtype input, and will just drop it instead. This is in preparation for multi-modal tabular+text support (#756)

Great! I really wanted the functions to ignore object dtype.

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py

Innixma · 2020-11-03T20:25:49Z

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py

+
+        bst = self.model.get_booster()
+        self.params_trained['n_estimators'] = bst.best_iteration + 1
+        self.params_trained['best_ntree_limit'] = bst.best_ntree_limit


Currently, the model is using the final trained iteration during prediction instead of the best iteration (found during early stopping):

[757] validation_0-logloss:0.26971 Stopping. Best iteration: [721] validation_0-logloss:0.26962 Saving AutogluonModels/ag-20201103_200907/models/XGBoostClassifier/model.pkl -0.2697 = Validation log_loss score 29.67s = Training runtime 0.16s = Validation runtime

I believe that the predict call has to be updated:

self.model.predict(data, ntree_limit=bst.best_ntree_limit)

Therefore, at the end of _fit, we can set a variable self._best_ntree_limit = bst.best_ntree_limit
and then call with:

self.model.predict(data, ntree_limit=self._best_ntree_limit)

This is the new code that should be added:

def _predict_proba(self, X, **kwargs): X = self.preprocess(X, **kwargs) if self.problem_type == REGRESSION: return self.model.predict(X, ntree_limit=self._best_ntree_limit) y_pred_proba = self.model.predict_proba(X, ntree_limit=self._best_ntree_limit) if self.problem_type == BINARY: if len(y_pred_proba.shape) == 1: return y_pred_proba elif y_pred_proba.shape[1] > 1: return y_pred_proba[:, 1] else: return y_pred_proba elif y_pred_proba.shape[1] > 2: return y_pred_proba else: return y_pred_proba[:, 1]

Once this is added, the correct iteration is used:

[757] validation_0-logloss:0.26971 Stopping. Best iteration: [721] validation_0-logloss:0.26962 Saving AutogluonModels/ag-20201103_202139/models/XGBoostClassifier/model.pkl -0.2696 = Validation log_loss score 29.17s = Training runtime 0.12s = Validation runtime

Yes, that's a good point. The model is using the last trained iterations at the time early stopped, not best iteration. I was trying not to override _predict_proba() and considering whether to use best_ntree_limit for prediction. I had a plan, but I forgot to this. 😢 That's the reason I wrote following code.

self.params_trained['best_ntree_limit'] = bst.best_ntree_limit

I appreciate your thoughtful review. I will add the codes.

Innixma · 2020-11-03T20:26:40Z

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py

+        )
+
+        bst = self.model.get_booster()
+        self.params_trained['n_estimators'] = bst.best_iteration + 1


Can actually replace with simply self.params_trained['n_estimators'] = bst.best_ntree_limit as bst.best_ntree_limit == bst.best_iteration + 1

Yes, there are same. I will replace it simply.

tabular/setup.py

Innixma · 2020-11-03T20:39:23Z

tabular/src/autogluon/tabular/models/xgboost/callbacks.py

+        i = env.iteration
+        if i % period == 0 or i + 1 == env.begin_iteration or i + 1 == env.end_iteration:
+            msg = '\t'.join([_fmt_metric(x, show_stdv) for x in env.evaluation_result_list])
+            logger.log(20, '[%d]\t%s\n' % (i, msg))


No need for \n in the log messages, as logger.log already adds a \n to every message. This applies to every log message in early_stop_custom as well.

I will remove all \n at the end every log message.

jwmueller · 2020-11-04T22:07:20Z

examples/tabular/example_advanced_tabular.py

 hyperparams = {'NN': {'num_epochs': 10, 'activation': 'relu', 'dropout_prob': ag.Real(0.0,0.5)},
-               'GBM': {'num_boost_round': 1000, 'learning_rate': ag.Real(0.01,0.1,log=True)} }
+               'GBM': {'num_boost_round': 1000, 'learning_rate': ag.Real(0.01,0.1,log=True)},
+               'XGB': {'n_estimators': 1000, 'learning_rate': ag.Real(0.01,0.1,log=True)} }


Not sure why we would specifically add XGboost to this example. The example is just supposed to illustrate how users can exert more control over fit(), not to highlight what models are available.

Instead I think we should add a dedicated unit test that evaluates just the XGBoost model alone. @Innixma what do you think?

Ideally we want a unit test for all of our models, so I think thats something that we can do after the PR to avoid delays in merging. Regarding example_advanced_tabular.py, I think it can be reverted to be unchanged, and a dedicated XGB test can be added in future (along with all the other models).

sackoh · 2020-11-05T16:08:51Z

@Innixma Thank your for your comments.
I have tested with few of my datasets to check speed of training and inferencing XGBoost model. And I found some issues with xgboost package.

One of possible issues is n_jobs parameter. n_jobs doesn't work as intended. In a 1.2.0 version of xgboost, the value -1, 0 doesn't set a model to use whole parallel threads. This issue was partially resolved in 1.2.1 version of xgboost, but there is still problem with XGBoost's scikit-learn API. A simple solution for this issue is just set the n_jobs parameter explicitly to maximum number of the environment not 0 or -1. It even works when I set the number of the n_jobs over the real number of the cores. (ex. set n_jobs = 99, even real core = 4) After solving this issue, train and inference time became much faster.

# possible option 1
import os
params = {'n_jobs': os.cpu_count()}  # or using multiprocessing

# possible option 2
params = {'n_jobs': 9999}

And another possible issue is a default parameters as you also mentioned. learning_rate and max_depth are usually affecting speed of train and inference. I think it would be better to change the default value of learning rate 0.03 to 0.1 as other models.

After resolving above 2 issues, the time was faster than before. If you don't mind it could be better to check whether performance would be improved.

Innixma · 2020-11-05T16:28:48Z

@sackoh I'm open to both of those changes, thanks for the deep dive! If you could add those changes along with addressing the comments, I can do another benchmark run to see how it compares.

…ular_xgboost

Note that `os.cpu_count()` worse the inference speed when the number of CPUs in the training system is lower than in the inferencing system.

sackoh · 2020-11-09T16:36:24Z

@Innixma After you review, I will remove XGB from the default configs.

Thank you for your reviews in advance!

szha · 2020-11-09T18:16:52Z

Job PR-691-12 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-691/12/index.html

Innixma · 2020-11-10T23:31:29Z

@sackoh Awesome work with the optimizations, here are the results:

AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier VS all
                                       framework  > AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier  < AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier  = AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier  % Less Avg. Errors Than AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier  time_train_s  metric_error  time_infer_s  loss_rescaled      rank  rank=1_count  rank=2_count  rank=3_count  rank>3_count  error_count
0  AutoGluon_4h_2020_11_02_xgb_XGBoostClassifier                                               19                                               18                                                0                                          -1.402461                       1148.215899      0.384329      6.816518       0.486486  1.486486            19            18             0             0            0
1  AutoGluon_4h_2020_11_09_xgb_XGBoostClassifier                                                0                                                0                                               37                                           0.000000                        375.460625      0.344112      1.329315       0.513514  1.513514            18            19             0             0            0

With the optimizations, XGBoost has no noticeable drop is predictive accuracy, while being ~3x faster to train and ~5x faster to infer, bringing it inline with the other GBM models. With these improvements, I think we can keep XGBoost in the default config and as a default dependency (we may move to optional in future PR).

As a final preparation for merging, please resolve the minor conflict in presets.py and add in a test_xgboost.py unittest to tabular/tests/unittests/models/test_xgboost.py. You can reference test_lightgbm.py for the format (should be just a few lines of code edit after copy/paste).

Finally, if you are interested in contributing to AutoGluon in the future, here is an invite link to our developer slack channel. Feel free to message me on slack if you want to learn more about what we are working on. Thanks again for the high quality contribution!

…ular_xgboost � Conflicts: � tabular/src/autogluon/tabular/trainer/model_presets/presets.py

szha · 2020-11-12T17:09:18Z

Job PR-691-13 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-691/13/index.html

Innixma

Looks great, thanks for the contribution!

sackoh · 2020-11-17T13:46:53Z

@Innixma I'm pleased to be invited to the slack channel.
I'm going to introduce AutoGluon to our organization and interested in continuous contribution on AutoGluon.

I'll contact you soon through the channel.
Thanks again!

* Add xgboost model and utils to fit * Add custom callback functions for early stopping * Add basic params and hyperparameter spaces to tune xgboost model * Update tabular prediction to include xgboost model for training * Updated xgboost model fit to exclude invalid params `num_threads`, `num_gpus` * Added XGBoost model to advanced tabular examples for test * Modified env.iteration to best_iteration * Removed overwritten parameter n_jobs * Changed rabit to logger and Added print_evaluation to log iteration every 50 steps * Modified to log every 50 or 1 iterations with callbacks * Updated learning_rate in searchsapces to set equal with default hyperparameters * Updated thread parameters to use all cores as default * Updated xgboost model to use 'OneHotMergeRaresHandleUnknownEncoder' * Updated small changes to import and setup * Updated setup.py * Updated way to get max_category_levels parameter * Updated setup.py and Fixed typos after rebase * Updated preprocess to use refit_full * Updated try import xgboost * Deleted `\n` from every log message * Updated `n_jobs` to use whole parallel threads Note that `os.cpu_count()` worse the inference speed when the number of CPUs in the training system is lower than in the inferencing system. * Added `_get_default_auxiliary_params` and `_predict_proba` * Added `test_xgboost` in a unittest

sackoh and others added 7 commits October 2, 2020 19:16

Add xgboost model and utils to fit

2351f21

Add custom callback functions for early stopping

aba7e39

Add basic params and hyperparameter spaces to tune xgboost model

52c042d

Update tabular prediction to include xgboost model for training

5460b4c

Updated xgboost model fit to exclude invalid params num_threads, `n…

7ab2f1d

…um_gpus`

Added XGBoost model to advanced tabular examples for test

7b239f9

Merge branch 'master' into tabular_xgboost

b881355

sackoh mentioned this pull request Oct 4, 2020

Tabular: Add XGBoost Model #589

Closed

Innixma requested review from Innixma and gradientsky October 6, 2020 02:08

Innixma reviewed Oct 6, 2020

View reviewed changes

jwmueller reviewed Oct 13, 2020

View reviewed changes

autogluon/utils/tabular/ml/models/xgboost/xgboost_utils.py Outdated Show resolved Hide resolved

jwmueller reviewed Oct 13, 2020

View reviewed changes

setup.py Outdated Show resolved Hide resolved

sackoh added 7 commits October 18, 2020 15:19

Modified env.iteration to best_iteration

e3dbf49

Removed overwritten parameter n_jobs

4e17b2d

Changed rabit to logger and Added print_evaluation to log iteration e…

4e44fc8

…very 50 steps

Modified to log every 50 or 1 iterations with callbacks

1257790

Updated learning_rate in searchsapces to set equal with default hyper…

7ce3f0b

…parameters

Updated thread parameters to use all cores as default

77b8c0e

Updated xgboost model to use 'OneHotMergeRaresHandleUnknownEncoder'

4dda5d1

sackoh requested review from Innixma and jwmueller October 20, 2020 06:41

Innixma mentioned this pull request Oct 20, 2020

Adding A Custom Model Question #718

Closed

Innixma reviewed Oct 20, 2020

View reviewed changes

sackoh force-pushed the tabular_xgboost branch 2 times, most recently from eab8d5e to 4dda5d1 Compare October 23, 2020 05:45

Innixma reviewed Oct 30, 2020

View reviewed changes

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py Outdated Show resolved Hide resolved

Innixma reviewed Oct 30, 2020

View reviewed changes

tabular/src/autogluon/tabular/models/xgboost/xgboost_model.py Outdated Show resolved Hide resolved

sackoh added 3 commits November 1, 2020 20:13

Merge remote-tracking branch 'origin' into tabular_xgboost

1a0d08c

Updated preprocess to use refit_full

c173552

Updated try import xgboost

d3b3058

Innixma reviewed Nov 3, 2020

View reviewed changes

jwmueller reviewed Nov 4, 2020

View reviewed changes

sackoh added 4 commits November 10, 2020 00:18

Merge branch 'master' of https://github.com/sackoh/autogluon into tab…

0b97352

…ular_xgboost

Deleted \n from every log message

1e8ff0c

Updated n_jobs to use whole parallel threads

5762244

Note that `os.cpu_count()` worse the inference speed when the number of CPUs in the training system is lower than in the inferencing system.

Added _get_default_auxiliary_params and _predict_proba

ae76b54

Innixma approved these changes Nov 10, 2020

View reviewed changes

sackoh added 2 commits November 13, 2020 00:11

Merge branch 'master' of https://github.com/sackoh/autogluon into tab…

8d2d7cb

…ular_xgboost � Conflicts: � tabular/src/autogluon/tabular/trainer/model_presets/presets.py

Added test_xgboost in a unittest

209f2e1

sackoh requested a review from Innixma November 12, 2020 15:22

Innixma approved these changes Nov 12, 2020

View reviewed changes

Innixma merged commit 69e6852 into autogluon:master Nov 12, 2020

Innixma linked an issue Nov 12, 2020 that may be closed by this pull request

Tabular: Add XGBoost Model #589

Closed

Innixma mentioned this pull request May 14, 2021

How to add a new model support for TabularPredictor? #1114

Closed

sackoh mentioned this pull request Aug 27, 2021

Updated xgboost to use all cpu cores with unified thread configuration #1289

Merged

Conversation

sackoh commented Oct 3, 2020 • edited by Innixma Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue #, if available:

Description of changes:

Features

Tested

TODO

Future works

Uh oh!

szha commented Oct 3, 2020

Uh oh!

Innixma commented Oct 6, 2020

Uh oh!

Innixma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sackoh commented Oct 11, 2020

Uh oh!

Uh oh!

Uh oh!

Innixma left a comment

Choose a reason for hiding this comment

Uh oh!

Innixma commented Oct 22, 2020

Uh oh!

Uh oh!

Uh oh!

sackoh commented Nov 1, 2020

Uh oh!

szha commented Nov 1, 2020

Uh oh!

Innixma left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Innixma Nov 3, 2020

Choose a reason for hiding this comment

Uh oh!

sackoh Nov 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Innixma Nov 3, 2020

Choose a reason for hiding this comment

Uh oh!

sackoh Nov 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Innixma Nov 3, 2020

Choose a reason for hiding this comment

Uh oh!

sackoh Nov 9, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Innixma Nov 3, 2020

Choose a reason for hiding this comment

Uh oh!

sackoh Nov 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jwmueller Nov 4, 2020

Choose a reason for hiding this comment

Uh oh!

Innixma Nov 5, 2020

Choose a reason for hiding this comment

Uh oh!

sackoh commented Oct 3, 2020 •

edited by Innixma

Loading

Innixma left a comment •

edited

Loading

sackoh Nov 9, 2020 •

edited

Loading

sackoh Nov 9, 2020 •

edited

Loading

sackoh Nov 9, 2020 •

edited

Loading

sackoh commented Nov 5, 2020 •

edited

Loading

sackoh commented Nov 9, 2020 •

edited

Loading