-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Open
Open
Copy link
Labels
enhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem or challenge?
Some Spark functions have equivalent DataFusion functions, differing slightly in behaviour. Work towards reducing duplicated code, calling out any differences between the functions when we can't merge them together.
Functions with Spark on left, DF (closest) equivalent on right:
- length -> character_length
- like -> like refactor: refactor spark like function to use datafusion like #19324
- Not much to deduplicate
- ilike -> ilike
- char -> chr
- ascii -> ascii
- Done by minor: refactor Spark ascii function to reuse DataFusion ascii function code #17965, though we could explore further deduplication
- rint -> round (?)
- mod/pmod -> modulo (?)
- hex -> to_hex (?)
- factorial -> factorial
-
expm1 -> exp - map_from_arrays -> map (?)
- sha1/sha2 -> sha* - @Jefffrey planning to look into this
- avg -> avg
- array -> make_array chore: merge make_array and spark array #19006
Not all of the above are exact equivalents, they're just listed to make it easier to check if we can reduce duplication to some degree. I've listed the Spark functions I noticed that seem to have something similar in DataFusion already.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request