Skip to content

Add unit tests for pandas-inherited pd.DataFrame methods with ArkoudaExtensionArray columns #5197

@ajpotts

Description

@ajpotts

Write unit tests covering pd.DataFrame methods that are inherited/available via pandas when one or more columns are backed by ArkoudaExtensionArray.

The goal is to verify pandas semantics (column-wise ops, alignment, dtype/NA behavior, indexing/selection, reductions, sort/groupby-compat where applicable) and catch regressions in inheritance/dispatch for Arkouda-backed DataFrames, including mixed DataFrames with both Arkouda-backed and NumPy-backed columns.

  • T
  • _AXIS_LEN
  • _AXIS_ORDERS
  • _AXIS_TO_AXIS_NUMBER
  • _HANDLED_TYPES
  • __abs__
  • __add__
  • __and__
  • __annotations__
  • __array__
  • __array_priority__
  • __array_ufunc__
  • __arrow_c_stream__
  • __bool__
  • __class__
  • __contains__
  • __copy__
  • __dataframe__
  • __dataframe_consortium_standard__
  • __deepcopy__
  • __delattr__
  • __delitem__
  • __dict__
  • __dir__
  • __divmod__
  • __doc__
  • __eq__
  • __finalize__
  • __firstlineno__
  • __floordiv__
  • __format__
  • __ge__
  • __getattr__
  • __getattribute__
  • __getitem__
  • __getstate__
  • __gt__
  • __hash__
  • __iadd__
  • __iand__
  • __ifloordiv__
  • __imod__
  • __imul__
  • __init__
  • __init_subclass__
  • __invert__
  • __ior__
  • __ipow__
  • __isub__
  • __iter__
  • __itruediv__
  • __ixor__
  • __le__
  • __len__
  • __lt__
  • __matmul__
  • __mod__
  • __module__
  • __mul__
  • __ne__
  • __neg__
  • __new__
  • __nonzero__
  • __or__
  • __pandas_priority__
  • __pos__
  • __pow__
  • __radd__
  • __rand__
  • __rdivmod__
  • __reduce__
  • __reduce_ex__
  • __repr__
  • __rfloordiv__
  • __rmatmul__
  • __rmod__
  • __rmul__
  • __ror__
  • __round__
  • __rpow__
  • __rsub__
  • __rtruediv__
  • __rxor__
  • __setattr__
  • __setitem__
  • __setstate__
  • __sizeof__
  • __static_attributes__
  • __str__
  • __sub__
  • __subclasshook__
  • __truediv__
  • __weakref__
  • __xor__
  • _accessors
  • _accum_func
  • _agg_examples_doc
  • _agg_see_also_doc
  • _align_for_op
  • _align_frame
  • _align_series
  • _append
  • _arith_method
  • _arith_method_with_reindex
  • _as_manager
  • _attrs
  • _box_col_values
  • _can_fast_transpose
  • _check_inplace_and_allows_duplicate_labels
  • _check_is_chained_assignment_possible
  • _check_label_or_level_ambiguity
  • _check_setitem_copy
  • _clear_item_cache
  • _clip_with_one_bound
  • _clip_with_scalar
  • _cmp_method
  • _combine_frame
  • _consolidate
  • _consolidate_inplace
  • _construct_axes_dict
  • _construct_result
  • _constructor
  • _constructor_from_mgr
  • _constructor_sliced
  • _constructor_sliced_from_mgr
  • _create_data_for_split_and_tight_to_dict
  • _data
  • _deprecate_downcast
  • _dir_additions
  • _dir_deletions
  • _dispatch_frame_op
  • _drop_axis
  • _drop_labels_or_levels
  • _ensure_valid_index
  • _find_valid_index
  • _flags
  • _flex_arith_method
  • _flex_cmp_method
  • _from_arrays
  • _from_mgr
  • _get_agg_axis
  • _get_axis
  • _get_axis_name
  • _get_axis_number
  • _get_axis_resolvers
  • _get_block_manager_axis
  • _get_bool_data
  • _get_cleaned_column_resolvers
  • _get_column_array
  • _get_index_resolvers
  • _get_item_cache
  • _get_label_or_level_values
  • _get_numeric_data
  • _get_value
  • _get_values_for_csv
  • _getitem_bool_array
  • _getitem_multilevel
  • _getitem_nocopy
  • _getitem_slice
  • _gotitem
  • _hidden_attrs
  • _indexed_same
  • _info_axis
  • _info_axis_name
  • _info_axis_number
  • _info_repr
  • _init_mgr
  • _inplace_method
  • _internal_names
  • _internal_names_set
  • _is_copy
  • _is_homogeneous_type
  • _is_label_or_level_reference
  • _is_label_reference
  • _is_level_reference
  • _is_mixed_type
  • _is_view
  • _is_view_after_cow_rules
  • _iset_item
  • _iset_item_mgr
  • _iset_not_inplace
  • _item_cache
  • _iter_column_arrays
  • _ixs
  • _logical_func
  • _logical_method
  • _maybe_align_series_as_frame
  • _maybe_cache_changed
  • _maybe_update_cacher
  • _metadata
  • _mgr
  • _min_count_stat_function
  • _needs_reindex_multi
  • _pad_or_backfill
  • _protect_consolidate
  • _reduce
  • _reduce_axis1
  • _reindex_axes
  • _reindex_multi
  • _reindex_with_indexers
  • _rename
  • _replace_columnwise
  • _repr_data_resource_
  • _repr_fits_horizontal_
  • _repr_fits_vertical_
  • _repr_html_
  • _repr_latex_
  • _reset_cache
  • _reset_cacher
  • _sanitize_column
  • _series
  • _set_axis
  • _set_axis_name
  • _set_axis_nocheck
  • _set_is_copy
  • _set_item
  • _set_item_frame_value
  • _set_item_mgr
  • _set_value
  • _setitem_array
  • _setitem_frame
  • _setitem_slice
  • _shift_with_freq
  • _should_reindex_frame_op
  • _slice
  • _stat_function
  • _stat_function_ddof
  • _take_with_is_copy
  • _to_dict_of_blocks
  • _to_latex_via_styler
  • _typ
  • _update_inplace
  • _validate_dtype
  • _values
  • _where
  • a
  • abs
  • add
  • add_prefix
  • add_suffix
  • agg
  • aggregate
  • ak
  • align
  • all
  • any
  • apply
  • applymap
  • asfreq
  • asof
  • assign
  • astype
  • at
  • at_time
  • attrs
  • axes
  • b
  • backfill
  • between_time
  • bfill
  • bool
  • boxplot
  • clip
  • columns
  • combine
  • combine_first
  • compare
  • convert_dtypes
  • copy
  • corr
  • corrwith
  • count
  • cov
  • cummax
  • cummin
  • cumprod
  • cumsum
  • describe
  • diff
  • div
  • divide
  • dot
  • drop
  • drop_duplicates
  • droplevel
  • dropna
  • dtypes
  • duplicated
  • empty
  • eq
  • equals
  • eval
  • ewm
  • expanding
  • explode
  • ffill
  • fillna
  • filter
  • first
  • first_valid_index
  • flags
  • floordiv
  • from_dict
  • from_records
  • ge
  • get
  • groupby
  • gt
  • head
  • hist
  • iat
  • idxmax
  • idxmin
  • iloc
  • index
  • infer_objects
  • info
  • insert
  • interpolate
  • isetitem
  • isin
  • isna
  • isnull
  • items
  • iterrows
  • itertuples
  • join
  • keys
  • kurt
  • kurtosis
  • last
  • last_valid_index
  • le
  • loc
  • lt
  • map
  • mask
  • max
  • mean
  • median
  • melt
  • memory_usage
  • merge
  • min
  • mod
  • mode
  • mul
  • multiply
  • ndim
  • ne
  • nlargest
  • notna
  • notnull
  • nsmallest
  • nunique
  • pad
  • pct_change
  • pipe
  • pivot
  • pivot_table
  • plot
  • pop
  • pow
  • prod
  • product
  • quantile
  • query
  • radd
  • rank
  • rdiv
  • reindex
  • reindex_like
  • rename
  • rename_axis
  • reorder_levels
  • replace
  • resample
  • reset_index
  • rfloordiv
  • rmod
  • rmul
  • rolling
  • round
  • rpow
  • rsub
  • rtruediv
  • sample
  • select_dtypes
  • sem
  • set_axis
  • set_flags
  • set_index
  • shape
  • shift
  • size
  • skew
  • sort_index
  • sort_values
  • squeeze
  • stack
  • std
  • style
  • sub
  • subtract
  • sum
  • swapaxes
  • swaplevel
  • tail
  • take
  • to_clipboard
  • to_csv
  • to_dict
  • to_excel
  • to_feather
  • to_gbq
  • to_hdf
  • to_html
  • to_json
  • to_latex
  • to_markdown
  • to_numpy
  • to_orc
  • to_parquet
  • to_period
  • to_pickle
  • to_records
  • to_sql
  • to_stata
  • to_string
  • to_timestamp
  • to_xarray
  • to_xml
  • transform
  • transpose
  • truediv
  • truncate
  • tz_convert
  • tz_localize
  • unstack
  • update
  • value_counts
  • values
  • var
  • where
  • xs

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions