Skip to main content
Filter by
Sorted by
Tagged with
Advice
0 votes
1 replies
83 views

How should I handle a mass-point in the dependent variable when running OLS regression in R? I’m working with a a household expenditure dataset (Living Costs 2019) where the dependent variable is the ...
Jimothan's user avatar
1 vote
1 answer
80 views

I'm currently in a Coding class and I'm making an app for a project. We have to insert Datasets into our app. The problem I am experiencing is that when I use my dropdown menu for options in my ...
Melina Nichols's user avatar
1 vote
1 answer
146 views

The idea is simple and no matter how much I have searched I couldn't find any similar example. Using the OleDbDataAdapter class, I am able to achieve the first step: Read a specific range in a ...
ptuga's user avatar
  • 79
1 vote
0 answers
108 views

I am trying to build a pytorch Dataset based on some .jsonl files. The size of each .jsonl file is about 2GB, and I have 50 such files. Therefore, it would be not very practical to load all these ...
xuehao-049's user avatar
-1 votes
0 answers
24 views

there is a set of elements that looks like <a data-v-924a11e2 ... </a> '924a11e2' of 'data-v-924a11e2' is random each time when page updates. And this dataset 'data-v-924a11e2' has no value. ...
Alex Krutoi's user avatar
0 votes
0 answers
103 views

The application I'm writing has a main form where all the database fields and DBGrid that contains the entire SQLite dataset (ExpItemQuery) reside. I have designed a database search functionality into ...
Skypilot65's user avatar
0 votes
0 answers
21 views

I have coded up a program that has a scoring target value plus other necessary values associated with that target value, plus the same features are used as dependents in my prediction engine. Up to ...
ciaran haines's user avatar
0 votes
1 answer
139 views

I am trying to load a training dataset in my Google Colab notebook but keep getting an error. Here is the code snippet which returns the error: from datasets import load_dataset ds = load_dataset(&...
AlecArk's user avatar
1 vote
1 answer
48 views

The survival package contains a heart dataset which comes in two format, "jasa" and "jasa1". How can I access the "jasa" format? I tried data("jasa", package=&...
robertspierre's user avatar
3 votes
1 answer
57 views

I’m currently working with a large dataset and need help merging multiple .fasta files. Although I’m not an expert, I’ve attempted to automate this process using a Python script. However, the merging ...
Andrea S.'s user avatar
0 votes
1 answer
54 views

I want to train LLM on TPUv4-32 using JAX/Flax. The dataset is stored in a mounted google storage bucket. The dataset (Red-Pajama-v2) consists of 5000 shards, which are stored in .json.gz files: ~/...
innerproduct's user avatar
3 votes
2 answers
106 views

I am currently working on a medical dataset containing whole slide images (~300 images). These images are pretty big (.tif, average of 7k by 7k pixels). I am currently dividing each of these images ...
Shini_ Gami's user avatar
0 votes
0 answers
70 views

I am trying to apply below transformation for preparing my datasets for fine tuning using unsloth huggingface. It requires the dataset to be in following format. def convert_to_conversation(sample): ...
SoraHeart's user avatar
  • 428
0 votes
1 answer
38 views

Why TensorDataset devide the data to minibatches? For example, when putting in it 2D array, instead of yielding 2D tensors as batches, it sets the required batches to be minibatches, and its actual &...
J. Doe's user avatar
  • 305
1 vote
1 answer
54 views

I am trying to help my church have a lending library. Currently, we have a Google sheet inserted in the Wix site. I am trying to make a better version. The idea is to import all the books as a .csv ...
Barney Joseph's user avatar
1 vote
1 answer
100 views

I hope you are doing well! I have developed map reduce script to send dataset results as a csv file using map reduce script. If there is huge data, script exceeds usgae limit in the reduce stage. I ...
Maira S's user avatar
  • 121
0 votes
1 answer
40 views

I have a custom Subset: class TestSubset2(Subset): def __init__(self, dataset, indices, days=False): super().__init__(dataset, indices) self.days = days def __getitem__(self, ...
Samuel Kostousov's user avatar
0 votes
2 answers
142 views

I am trying to use lookup activity but I want to use a dataset which is parameterized. I thought it will be simple but I think it doesn't support dataset parameters for a azure database source. Can ...
SandeepG's user avatar
0 votes
1 answer
31 views

How can I return the expected dataset (stored procedure) results from SSRS report Manager? SSRS report Manager is returning a different dataset (stored procedure) result than what's expected from ...
Andrew Phommathep's user avatar
0 votes
1 answer
50 views

I am trying to use a dataset parameter set in the pipeline to make my blob path dynamic for each data flow I've created. However, just testing this first data flow, I keep getting an error saying '...
creed6700's user avatar
0 votes
1 answer
69 views

I'm encountering a discrepancy between the Created date displayed in a NetSuite Saved Search and a NetSuite Dataset. The saved search shows one date, while the dataset shows the same record with the ...
Maira S's user avatar
  • 121
1 vote
1 answer
62 views

While I am trying to retrieve the SPICE storage copied per dataset in AWS Quicksight. I came across the following: AWS does not provide a direct per-dataset breakdown in the console, but you can use ...
Tam's user avatar
  • 4,027
0 votes
2 answers
50 views

I have the following data below: +-------+----------+------------+---------+---------------------+-----------+ |id |resource id|resource name|event-desc|event-date |ipaddress1 | +-------+---...
Santosh Shimpi's user avatar
1 vote
1 answer
75 views

I am testing for outliers using the iris dataset mod <- lm(Sepal.Width ~ Sepal.Length*Species, data = iris) I use rstudent() to calculate the studentized residuals, and add an indicator whether ...
mariann's user avatar
  • 35
1 vote
1 answer
70 views

I'm new to R and having next to nothing experience with it I'm struggling with what may be a pretty easy problem. I have a dataset of acceptability judgments provided by a group of 30 participants on ...
Daler Fergani's user avatar
0 votes
1 answer
89 views

I'm working on a CNN project where I make use of TensorFlow and Keras. In addition, I use interleave as the dataset is rather large (too large to load into the RAM with our resources). But there's an ...
William Frid's user avatar
0 votes
1 answer
99 views

I have a function to measure the allocated ram by python in megabytes: def getram(): print(psutil.Process(os.getpid()).memory_info().rss / 1024**2) And also I have: device = "cuda" My ...
Nex's user avatar
  • 3
0 votes
1 answer
46 views

I use this code to setup my dataset for trainning and predict: train_dataset = train_dataset.batch(train_batch_sz) train_dataset = train_dataset.repeat().prefetch(5) test_dataset_sim = test_dataset ...
Jonathan Roy's user avatar
0 votes
1 answer
116 views

I'm trying to load some data using datasets.load_datasets. It runs correctly on a head node. The issue is happening on a slurm node. I'm using a conda env with datasets installed. When I run on head ...
ate50eggs's user avatar
  • 454
0 votes
1 answer
89 views

I would like to try to work the dataset in Jupyter Notebook from the iPhone Carnets App, but I can’t search the correct path. If someone can help me. The dataset I download from Kaggle. Are free files ...
Samusdr17's user avatar
0 votes
0 answers
14 views

We are just beginning to use Oracle Analytics. We manage large amounts of data and much of what my team does is runs queries for lists of items. This can be 10, 100, or 1000s of different item ...
babs75's user avatar
  • 1
0 votes
1 answer
73 views

I am using a NetSuite dataset (N/dataset API in SuiteScript 2.1) to retrieve existing dataset results. The dataset returns results in JSON format, including columns like PO Status, Billing Status, Sub ...
Maira S's user avatar
  • 121
0 votes
1 answer
263 views

I'm trying to compute the recall after performing a HNSW search in FAISS. By recall, I mean the following metric: Recall = TP / (TP + FN) ​ Where I consider an image as a True Positive (TP) if it ...
No Yeah's user avatar
  • 23
1 vote
6 answers
113 views

I am currently trying to make a dataset which refers to a series of data in another tab, of which it takes 20 points (vertically) and averages them and gives a stdev then two open rows and then again ...
Jan van oest's user avatar
1 vote
0 answers
100 views

I am using the xarray library and I have some doubts/questions. I have this dataset:: ds <xarray.Dataset> Size: 2GB Dimensions: (Latitude: 364, Longitude: 246, Lon_u: 247, Lat_v: 364, ...
ChesuCR's user avatar
  • 9,752
0 votes
0 answers
15 views

I have a simple PowerBI dashboard that links to several Excel spreadsheets. I take a cut of data monthly, but want to save and allow the user to select previous iterations using a drop down slicer. ...
FraserS's user avatar
0 votes
1 answer
65 views

I have the following code: myDataSet.myTable.DefaultView.RowFilter = "Code <> 'X'" After the line executes, I can see the updated result where no rows contain an X for the Code column,...
Rich's user avatar
  • 6,585
0 votes
1 answer
63 views

I'm newbie in pytorch and AI. I found a repo (https://github.com/kacperbednarz1997/AMC_nets) where author compares various models for automatic modulation classification (AMC). I get all datasets used ...
AlexDtd's user avatar
  • 21
0 votes
1 answer
47 views

I want to inherit the torch.utils.data.Dataset class to load my custom image dataset, let's say for a classification task. here is the example of official pytorch website in this link: import os ...
Sepehr Amini Afshar's user avatar
0 votes
0 answers
28 views

In Loading a DataSet from XML, I've found the following statement: If you call ReadXml to load a very large file, you may encounter slow performance. To ensure best performance for ReadXml, on a ...
exaiwitmx's user avatar
  • 119
0 votes
0 answers
59 views

I'm working with the Yahoo! Webscope dataset ydata-frontpage-todaymodule-clicks-v1_0 (specifically, the click logs for the first ten days in May 2009). The dataset description states that each user ...
amarchin's user avatar
  • 2,124
0 votes
1 answer
76 views

I am currently working on a project where I have to first merge two datasets: The first dataset contains weather data in 30 minute intervals. The second dataset contains minute-level data with PV ...
TryingToLearn's user avatar
0 votes
0 answers
14 views

I'm comparing two datasets in SQL Server (tables of the same schema) using row hashing (for example, using CheckSum() or HashBytes()). At this point, I can tell which records are identical and which ...
waza's user avatar
  • 11
0 votes
1 answer
61 views

I have uploaded a csv I created. This dataset is visible in Kaggle under "My datasets" - I can open and inspect it. I have tried various ways of linking a dataset to my notebook. clicking ...
Heike Vau's user avatar
2 votes
0 answers
68 views

I am trying to set up a dataset in R to run a neural network using TensorFlow, but I can't seem to figure out the right code to allow sample weights to be specified. The input array is image_data and ...
D_Taylor's user avatar
0 votes
0 answers
13 views

Hi, I'm trying to implement a consistent color coding for organ-specific biodistribution data across multiple sheets in Tableau. My dataset is structured such that each organ is represented by its own ...
GalB's user avatar
  • 11
0 votes
2 answers
263 views

Can I add a QR image to a Crystal Report dynamically without adding a new column to the dataset and dragging it to the report? I retrieve the URL from the database, and based on that, I display the ...
ahmed salah abd elaziz elbarba's user avatar
0 votes
0 answers
109 views

Given: import os from torchvision.datasets import ImageNet from torch.utils.data import DataLoader train_dataset = ImageNet( root=os.path.expanduser("~/.cache"), split='train', ...
farid's user avatar
  • 1,631
0 votes
1 answer
53 views

We are trying to fine-tune a custom model on an imported DeiT distilled patch16 384 pretrained model. Output: Cost at epoch 0 is 4.611058227040551 Cost at epoch 1 is 0.9889081553979353 test set ...
shamilemir's user avatar
1 vote
1 answer
87 views

I need to merge datasets A and B based on the date, id and nearest time. The time in the datasets are not matched. The time in B is always 0 to 10 minutes greater than the time in A. I tried left_join ...
Mee's user avatar
  • 321

1
2 3 4 5
227