Shortcuts

VisualGroundingInferencer

class mmpretrain.apis.VisualGroundingInferencer(model, pretrained=True, device=None, device_map=None, offload_folder=None, **kwargs)[源代码]

The inferencer for visual grounding.

参数:
  • model (BaseModel | str | Config) – A model name or a path to the config file, or a BaseModel object. The model name can be found by VisualGroundingInferencer.list_models() and you can also query it in 模型库统计.

  • pretrained (str, optional) – Path to the checkpoint. If None, it will try to find a pre-defined weight from the model you specified (only work if the model is a model name). Defaults to None.

  • device (str, optional) – Device to run inference. If None, the available device will be automatically used. Defaults to None.

  • **kwargs – Other keyword arguments to initialize the model (only work if the model is a model name).

示例

>>> from mmpretrain import VisualGroundingInferencer
>>> inferencer = VisualGroundingInferencer('ofa-base_3rdparty_refcoco')
>>> inferencer('demo/cat-dog.png', 'dog')[0]
{'pred_bboxes': tensor([[ 36.6000,  29.6000, 355.8000, 395.2000]])}
__call__(images, texts, return_datasamples=False, batch_size=1, **kwargs)[源代码]

Call the inferencer.

参数:
  • images (str | array | list) – The image path or array, or a list of images.

  • texts (str | list) – The text to do visual grounding.

  • return_datasamples (bool) – Whether to return results as DataSample. Defaults to False.

  • batch_size (int) – Batch size. Defaults to 1.

  • resize (int, optional) – Resize the short edge of the image to the specified length before visualization. Defaults to None.

  • draw_score (bool) – Whether to draw the prediction scores of prediction categories. Defaults to True.

  • show (bool) – Whether to display the visualization result in a window. Defaults to False.

  • wait_time (float) – The display time (s). Defaults to 0, which means “forever”.

  • show_dir (str, optional) – If not None, save the visualization results in the specified directory. Defaults to None.

  • line_width (int) – The line width of the bbox. Defaults to 3.

  • bbox_color (str | tuple) – The color of the bbox. Defaults to ‘green’.

返回:

The inference results.

返回类型:

list

static list_models(pattern=None)[源代码]

List all available model names.

参数:

pattern (str | None) – A wildcard pattern to match model names.

返回:

a list of model names.

返回类型:

List[str]

Read the Docs v: latest
Versions
latest
stable
mmcls-1.x
mmcls-0.x
dev
Downloads
epub
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.