Clean Upvote and Downvote data #3611

CodingWithTim · 2024-11-08T06:35:28Z

Why are these changes needed?

Related issue number (if applicable)

Checks

I've run format.sh to lint the changes in this PR.
I've included any doc changes needed.
I've made sure the relevant tests are passing (if applicable).

CodingWithTim · 2024-11-08T06:59:21Z

> python clean_chat_data.py --action-type upvote

Will grab all the upvotes.

infwinston

Thanks @CodingWithTim left some comments

fastchat/serve/monitor/clean_chat_data.py

CodingWithTim · 2024-11-10T00:30:22Z

@infwinston Suggestions integrated.

infwinston

a critical thing should be fixed. chunk_size shouldn't be 1 otherwise we will introduce lots of overhead and it might be even slower than sequential implementation.

fastchat/serve/monitor/clean_chat_data.py

infwinston · 2024-11-11T00:27:17Z

fastchat/serve/monitor/clean_chat_data.py

+    # Aggregate results from child processes
+    ct_invalid_conv_id = sum(
+        [data["ct_invalid_conv_id"] for data in results if "ct_invalid_conv_id" in data]
+    )
+    ct_invalid = sum([data["ct_invalid"] for data in results if "ct_invalid" in data])
+    ct_network_error = sum(
+        [data["ct_network_error"] for data in results if "ct_network_error" in data]
+    )
+    all_models = set([data["model"] for data in results if "model" in data])
+    chats = [data["result"] for data in results if "result" in data]


Can we merge into one for loop?

optimize clean_data script

234e3b0

infwinston reviewed Nov 9, 2024

View reviewed changes

fastchat/serve/monitor/clean_chat_data.py Outdated Show resolved Hide resolved

fastchat/serve/monitor/clean_chat_data.py Outdated Show resolved Hide resolved

CodingWithTim added 2 commits November 10, 2024 00:23

improve

f21c56f

improve 2

bbf5752

improve 3

78941e8

infwinston reviewed Nov 10, 2024

View reviewed changes

fastchat/serve/monitor/clean_chat_data.py Outdated Show resolved Hide resolved

fastchat/serve/monitor/clean_chat_data.py Outdated Show resolved Hide resolved

fix chunck size

7ced22d

infwinston reviewed Nov 10, 2024

View reviewed changes

fastchat/serve/monitor/clean_chat_data.py Show resolved Hide resolved

infwinston reviewed Nov 11, 2024

View reviewed changes

fastchat/serve/monitor/clean_chat_data.py Outdated Show resolved Hide resolved

CodingWithTim added 5 commits November 11, 2024 00:24

fix chunck size

b8f6028

fix chunck size

e0cd21e

add imap

e790f0b

add imap

d8b6c4c

update default parallel

b803d47

infwinston approved these changes Nov 13, 2024

View reviewed changes

infwinston merged commit 5ac9372 into main Nov 13, 2024

infwinston deleted the clean-upvote-downvote branch November 13, 2024 01:48

This was referenced Jun 23, 2025

Feature implementation from commits 2c68a13..1cd4b74 yashuatla/FastChat#2

Open

Feature implementation from commits 559fdc6..5ac9372 yashuatla/FastChat#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean Upvote and Downvote data #3611

Clean Upvote and Downvote data #3611

Uh oh!

CodingWithTim commented Nov 8, 2024 •

edited

Loading

Uh oh!

CodingWithTim commented Nov 8, 2024

Uh oh!

infwinston left a comment

Uh oh!

Uh oh!

Uh oh!

CodingWithTim commented Nov 10, 2024

Uh oh!

infwinston left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

infwinston Nov 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Clean Upvote and Downvote data #3611

Clean Upvote and Downvote data #3611

Uh oh!

Conversation

CodingWithTim commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number (if applicable)

Checks

Uh oh!

CodingWithTim commented Nov 8, 2024

Uh oh!

infwinston left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CodingWithTim commented Nov 10, 2024

Uh oh!

infwinston left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

infwinston Nov 11, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CodingWithTim commented Nov 8, 2024 •

edited

Loading