10 questions
3
votes
2
answers
15k
views
Why RandomForestClassifier on CPU (using SKLearn) and on GPU (using RAPIDs) get differents scores, very different?
I am using RandomForestClassifier on CPU with SKLearn and on GPU using RAPIDs. I am doing a benchmark between these two libraries about speed up and scoring using Iris dataset (it is a try, in the ...
9
votes
4
answers
23k
views
Rapids / docker: could not select device driver "" with capabilities: [[gpu]]
I'm new to Rapids, and rarely have had a good experience with conda. So I'm trying to work with a containerized version. I'm new to Docker, and the combination of unknowns leaves me unable to sort ...
4
votes
2
answers
9k
views
Installing cuDF & cuML into Colab with Rapids.ai version 0.11+
I'm trying to install Rapids library with cuDF and cuML to Colab session,
and executing code accroding to this example:
from
Install RAPIDS library on Googe Colab notebook
!wget -nc https://raw....
2
votes
4
answers
3k
views
Install RAPIDS library on Google Colab notebook
I was wondering if I could install RAPIDS library (executing machine learning tasks entirely on GPU) in Google Colaboratory notebook?
I've done some research but I've not been able to find the way to ...
1
vote
0
answers
330
views
dask_cudf/dask read_parquet failed with NotImplementedError: large_string
I am a new user of dask/dask_cudf.
I have a parquet files of various sizes (11GB, 2.5GB, 1.1GB), all of which failed with NotImplementedError: large_string. My dask.dataframe backend is cudf. When the ...
1
vote
0
answers
223
views
dask_cudf dataframe convert column of datetime string to column of datetime object
I am a new user of Dask and RapidsAI.
An exerpt of my data (in csv format):
Symbol,Date,Open,High,Low,Close,Volume
AADR,17-Oct-2017 09:00,57.47,58.3844,57.3645,58.3844,2094
AADR,17-Oct-2017 10:00,57....
0
votes
1
answer
553
views
Rapids on colab
I have always used following commands to install Rapids on Colab (from https://colab.research.google.com/drive/1rY7Ln6rEE1pOlfSHCYOVaqt8OvDO35J0#forceEdit=true&offline=true&sandboxMode=true)
!...
0
votes
1
answer
811
views
User defined function to combine CUDF dataframe columns
As per the title, I am trying to combine the row values from different cudf.DataFrame columns. The following code works for a standard pandas.DataFrame:
import pandas as pd
data = {'a': [1], 'b': [2], ...
0
votes
0
answers
1k
views
How do you do a grid search with cuml without a datatype error?
I tried doing a grid search with cuml. (rapids 21.10) I get a cupy conversion error. This doesn't happen if I build the model with the same dataset without a grid search. It also works doing it with ...
-1
votes
1
answer
441
views
DASK CUDA on multi node EMR cluster is unable to detect nodes
I have setup an AWS EMR cluster using 10 core nodes of type g4dn.xlarge (each machine/node conatins 1 GPU). When I run the following commands on Zeppelin Notebook, I see only 1 worker allotted in my ...