chore: update fun_asr_nano batch inference model logic by Hsuxu · Pull Request #2858 · modelscope/FunASR

Hsuxu · 2026-05-22T15:20:09Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces batch inference support for the fun_asr_nano model by implementing a sequential processing loop within the inference method and removes the previous restriction on batch sizes greater than one. The changes also include extensive code reformatting for better readability. Feedback highlights several logic issues in the new batch implementation: the error handler's dummy result is missing required keys, metadata is lost for all but the last item in a batch, and there are potential TypeError risks if input keys are mismatched. Additionally, the implementation currently ignores user-provided data lengths and may lead to silent data loss if internal methods are called directly with multiple items.

gemini-code-assist · 2026-05-22T15:21:46Z

+                        f"batch item {i} inference failed: {str(e)}, {traceback.format_exc()}"
+                    )
+                    if single_key:
+                        all_results.append({"key": single_key[0], "text": ""})


The dummy result dictionary used in the exception handler is missing several keys (text_tn, label, ctc_text, timestamps, etc.) that are present in a standard result. Downstream code expecting these keys will encounter a KeyError. Ensure the error result structure is consistent with the successful one.

References

Ensure that invalid inputs or states are safely handled in all cases. ^(link)

gemini-code-assist · 2026-05-22T15:21:46Z

-        if kwargs.get("batch_size", 1) > 1:
-            raise NotImplementedError("batch decoding is not implemented")
-
        contents = self.data_template(data_in[0])


The removal of the NotImplementedError for batch_size > 1 is problematic because inference_prepare (and consequently inference_llm) still only processes the first element of data_in (data_in[0]). While the inference method now handles multiple inputs via a loop, calling inference_llm directly with a list of multiple items will result in only the first item being processed without any warning or error. It is recommended to keep a check on the length of data_in within inference_llm or inference_prepare to prevent silent data loss.

gemini-code-assist · 2026-05-22T15:21:46Z

+            last_meta = {}
+            for i, single_data in enumerate(data_in):
+                single_key = [key[i]] if i < len(key) else None
+                try:
+                    res, meta = self.inference_llm(
+                        [single_data],
+                        data_lengths=None,
+                        key=single_key,
+                        tokenizer=tokenizer,
+                        frontend=frontend,
+                        **kwargs,
+                    )
+                    all_results.extend(res)
+                    last_meta = meta
+                except Exception as e:
+                    logging.error(
+                        f"batch item {i} inference failed: {str(e)}, {traceback.format_exc()}"
+                    )
+                    if single_key:
+                        all_results.append({"key": single_key[0], "text": ""})
+            return all_results, last_meta


The last_meta variable only captures the metadata from the final iteration of the loop. This leads to the loss of metadata (such as encoder outputs, internal states, or specific timing information) for all other items in the batch. Consider returning a list of metadata dictionaries or merging them to maintain consistency with the all_results list, especially if this model is used in pipelines where per-sample metadata is required.

gemini-code-assist · 2026-05-22T15:21:46Z

+            all_results = []
+            last_meta = {}
+            for i, single_data in enumerate(data_in):
+                single_key = [key[i]] if i < len(key) else None


If a key list is provided by the user but is shorter than data_in, single_key will be set to None. This will cause a TypeError in inference_llm at line 687 (or line 764) when it attempts to subscript key. It is safer to validate the length of key at the beginning of the inference method or provide a fallback key in the loop.

References

Defensive programming: ensure that invalid inputs or states are safely handled. ^(link)

Ensure appropriate null/nil/None checks or other language-idiomatic guards exist before object property accesses.

gemini-code-assist · 2026-05-22T15:21:46Z

+                try:
+                    res, meta = self.inference_llm(
+                        [single_data],
+                        data_lengths=None,


The data_lengths argument is hardcoded to None in the batch loop, which means any lengths provided by the user to the inference method are ignored for batch items. This may affect model components that rely on explicit length information.

References

Verify code functionality and ensure alignment between function descriptions and implementations. ^(link)

chore: update fun_asr_nano batch inference model logic

ab0873b

gemini-code-assist Bot reviewed May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: update fun_asr_nano batch inference model logic#2858

chore: update fun_asr_nano batch inference model logic#2858
Hsuxu wants to merge 1 commit into
modelscope:mainfrom
Hsuxu:feat/fun-asr-nano_batchinfer

Hsuxu commented May 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 22, 2026

Uh oh!

gemini-code-assist Bot May 22, 2026

Uh oh!

gemini-code-assist Bot May 22, 2026

Uh oh!

gemini-code-assist Bot May 22, 2026

Uh oh!

gemini-code-assist Bot May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Hsuxu commented May 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant