Two more questions need answering before these findings can become actionable:
How do these two groups compared to a third group that can use both? ChatGPT is pretty useless on its own when correctness is important, but it improves a lot when you combine it with ways to verify its output.
How much time and effort would this new group need to accomplish the same task? One of ChatGPT’s strengths is being able to communicate a piece of information in many different ways, and in whatever order you ask of it. It’s then much faster to verify or through a legitimate source than it is to learn from those sources in the first place.
Two more questions need answering before these findings can become actionable: