Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: googleapis/python-bigquery-dataframes
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v0.6.0
Choose a base ref
...
head repository: googleapis/python-bigquery-dataframes
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v0.7.0
Choose a head ref
  • 12 commits
  • 37 files changed
  • 9 contributors

Commits on Oct 4, 2023

  1. feat: add aliases for several series properties (#80)

    * feat: add aliases for several series properties
    
    ---------
    
    Co-authored-by: Tim Swast <swast@google.com>
    TrevorBergeron and tswast authored Oct 4, 2023
    Copy the full SHA
    c0efec8 View commit details
  2. feat: Allow df.drop to take an index object (#68)

    * feat: Allow passing index objects to df.drop
    
    * remove notebook files
    
    * add first implementation for df.drop(index)
    
    * use index_columns property
    
    * don't use _expr.keys()
    
    * fix order bug and add test
    
    * fix index names
    
    * support multiindex
    
    * remove accidentally added files
    
    * add type hint
    
    * remove debug print statements
    
    * fix: fix df/series.iloc by list with multiindex
    
    * complete merge
    
    * remove unneeded isinstance
    
    * refactor _drop_by_index
    
    ---------
    
    Co-authored-by: Henry J Solberg <henryjsolberg@google.com>
    Co-authored-by: Tim Swast <swast@google.com>
    3 people authored Oct 4, 2023
    Copy the full SHA
    740c451 View commit details

Commits on Oct 5, 2023

  1. refactor: simplify ArrayValue public interface (#82)

    * refactor: simplify ArrayValue public interface
    
    ---------
    
    Co-authored-by: Tim Swast <swast@google.com>
    TrevorBergeron and tswast authored Oct 5, 2023
    Copy the full SHA
    ff3bb89 View commit details
  2. feat: add equals methods to series/dataframe (#76)

    * feat: add equals methods to series/dataframe
    TrevorBergeron authored Oct 5, 2023
    Copy the full SHA
    636a209 View commit details

Commits on Oct 7, 2023

  1. fix: change the invalid url in docs (#93)

    * fix: change the invalid url in docs
    
    * fix: address the comment
    ashleyxuu authored Oct 7, 2023
    Copy the full SHA
    969800d View commit details

Commits on Oct 9, 2023

  1. chore(deps): bump urllib3 from 1.26.12 to 1.26.17 in /.kokoro (#96)

    Source-Link: googleapis/synthtool@0c7b033
    Post-Processor: gcr.io/cloud-devrel-public-resources/owlbot-python:latest@sha256:08e34975760f002746b1d8c86fdc90660be45945ee6d9db914d1508acdf9a547
    
    Co-authored-by: Owl Bot <gcf-owl-bot[bot]@users.noreply.github.com>
    gcf-owl-bot[bot] and gcf-owl-bot[bot] authored Oct 9, 2023
    Copy the full SHA
    5b0ee39 View commit details

Commits on Oct 10, 2023

  1. docs: add more preprocessing models into the docs menu. (#97)

    * docs: add more preprocessing models into the docs menu.
    
    * Trigger kokoro rebuild
    ashleyxuu authored Oct 10, 2023
    Copy the full SHA
    1592315 View commit details
  2. refactor: refactor the directory of the notebooks (#89)

    Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:
    - [ ] Make sure to open an issue as a [bug/issue](https://togithub.com/googleapis/python-bigquery-dataframes/issues/new/choose) before writing your code!  That way we can discuss the change, evaluate designs, and agree on the general idea
    - [ ] Ensure the tests and linter pass
    - [ ] Code coverage does not decrease (if any source code was changed)
    - [ ] Appropriate docs were updated (if necessary)
    
    Fixes #<issue_number_goes_here> 🦕
    ashleyxuu authored Oct 10, 2023
    Copy the full SHA
    e1ee09b View commit details
  3. feat: add iat and iloc accessing by tuples of integers (#90)

    * feat: add iat indexers
    
    * select_columns correction
    
    * add iloc for tuples
    
    * add dataframe tests
    
    * add series tests
    
    * add third party comments
    
    ---------
    
    Co-authored-by: Henry J Solberg <henryjsolberg@google.com>
    milkshakeiii and Henry J Solberg authored Oct 10, 2023
    Copy the full SHA
    228aeba View commit details
  4. feat: add level param to DataFrame.stack (#88)

    * feat: add level param to DataFrame.stack
    TrevorBergeron authored Oct 10, 2023
    Copy the full SHA
    97b8bec View commit details

Commits on Oct 11, 2023

  1. Copy the full SHA
    4ae4ef9 View commit details
  2. chore(main): release 0.7.0 (#86)

    Co-authored-by: release-please[bot] <55107282+release-please[bot]@users.noreply.github.com>
    release-please[bot] authored Oct 11, 2023
    Copy the full SHA
    d39134d View commit details
Showing with 1,179 additions and 567 deletions.
  1. +2 −2 .github/.OwlBot.lock.yaml
  2. +3 −3 .kokoro/requirements.txt
  3. +22 −0 CHANGELOG.md
  4. +3 −1 bigframes/_config/bigquery_options.py
  5. +24 −0 bigframes/clients.py
  6. +90 −155 bigframes/core/__init__.py
  7. +33 −0 bigframes/core/block_transforms.py
  8. +92 −36 bigframes/core/blocks.py
  9. +0 −4 bigframes/core/groupby/__init__.py
  10. +44 −0 bigframes/core/indexers.py
  11. +1 −3 bigframes/core/indexes/index.py
  12. +8 −8 bigframes/core/joins/row_identity.py
  13. +12 −10 bigframes/core/joins/single_column.py
  14. +74 −18 bigframes/dataframe.py
  15. +2 −1 bigframes/ml/base.py
  16. +19 −4 bigframes/ml/llm.py
  17. +10 −0 bigframes/operations/__init__.py
  18. +0 −6 bigframes/operations/base.py
  19. +30 −44 bigframes/remote_function.py
  20. +32 −1 bigframes/series.py
  21. +5 −1 bigframes/session.py
  22. +1 −1 bigframes/version.py
  23. +10 −2 docs/templates/toc.yml
  24. +196 −196 notebooks/{getting_started → generative_ai}/bq_dataframes_llm_code_generation.ipynb
  25. +3 −3 notebooks/{getting_started → regression}/bq_dataframes_ml_linear_regression.ipynb
  26. +2 −2 noxfile.py
  27. +36 −1 tests/system/small/ml/test_llm.py
  28. +180 −2 tests/system/small/test_dataframe.py
  29. +25 −13 tests/system/small/test_multiindex.py
  30. +31 −0 tests/system/small/test_remote_function.py
  31. +52 −0 tests/system/small/test_series.py
  32. +57 −0 tests/unit/test_clients.py
  33. +8 −34 tests/unit/test_core.py
  34. +32 −0 third_party/bigframes_vendored/pandas/core/frame.py
  35. +24 −0 third_party/bigframes_vendored/pandas/core/series.py
  36. +1 −1 third_party/bigframes_vendored/sklearn/preprocessing/_data.py
  37. +15 −15 third_party/bigframes_vendored/sklearn/preprocessing/_encoder.py
4 changes: 2 additions & 2 deletions .github/.OwlBot.lock.yaml
Original file line number Diff line number Diff line change
@@ -13,5 +13,5 @@
# limitations under the License.
docker:
image: gcr.io/cloud-devrel-public-resources/owlbot-python:latest
digest: sha256:fac304457974bb530cc5396abd4ab25d26a469cd3bc97cbfb18c8d4324c584eb
# created: 2023-10-02T21:31:03.517640371Z
digest: sha256:08e34975760f002746b1d8c86fdc90660be45945ee6d9db914d1508acdf9a547
# created: 2023-10-09T14:06:13.397766266Z
6 changes: 3 additions & 3 deletions .kokoro/requirements.txt
Original file line number Diff line number Diff line change
@@ -467,9 +467,9 @@ typing-extensions==4.4.0 \
--hash=sha256:1511434bb92bf8dd198c12b1cc812e800d4181cfcb867674e0f8279cc93087aa \
--hash=sha256:16fa4864408f655d35ec496218b85f79b3437c829e93320c7c9215ccfd92489e
# via -r requirements.in
urllib3==1.26.12 \
--hash=sha256:3fa96cf423e6987997fc326ae8df396db2a8b7c667747d47ddd8ecba91f4a74e \
--hash=sha256:b930dd878d5a8afb066a637fbb35144fe7901e3b209d1cd4f524bd0e9deee997
urllib3==1.26.17 \
--hash=sha256:24d6a242c28d29af46c3fae832c36db3bbebcc533dd1bb549172cd739c82df21 \
--hash=sha256:94a757d178c9be92ef5539b8840d48dc9cf1b2709c9d6b588232a055c524458b
# via
# requests
# twine
22 changes: 22 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -4,6 +4,28 @@

[1]: https://pypi.org/project/bigframes/#history

## [0.7.0](https://github.com/googleapis/python-bigquery-dataframes/compare/v0.6.0...v0.7.0) (2023-10-11)


### Features

* Add aliases for several series properties ([#80](https://github.com/googleapis/python-bigquery-dataframes/issues/80)) ([c0efec8](https://github.com/googleapis/python-bigquery-dataframes/commit/c0efec8956198247b27904345a795f09c80d3502))
* Add equals methods to series/dataframe ([#76](https://github.com/googleapis/python-bigquery-dataframes/issues/76)) ([636a209](https://github.com/googleapis/python-bigquery-dataframes/commit/636a209e0853501abd50784a11a87cf7f2282ee5))
* Add iat and iloc accessing by tuples of integers ([#90](https://github.com/googleapis/python-bigquery-dataframes/issues/90)) ([228aeba](https://github.com/googleapis/python-bigquery-dataframes/commit/228aeba09782ae2421040c7601c15d4af92790b6))
* Add level param to DataFrame.stack ([#88](https://github.com/googleapis/python-bigquery-dataframes/issues/88)) ([97b8bec](https://github.com/googleapis/python-bigquery-dataframes/commit/97b8bec1175499c74448a4fd46b4888c4b4c35c1))
* Allow df.drop to take an index object ([#68](https://github.com/googleapis/python-bigquery-dataframes/issues/68)) ([740c451](https://github.com/googleapis/python-bigquery-dataframes/commit/740c45176f79d4d2f7f28cb5f6c9eeb1327c8397))
* Use default session connection ([#87](https://github.com/googleapis/python-bigquery-dataframes/issues/87)) ([4ae4ef9](https://github.com/googleapis/python-bigquery-dataframes/commit/4ae4ef995348b95521c4988a8cfb3b5ac792fd69))


### Bug Fixes

* Change the invalid url in docs ([#93](https://github.com/googleapis/python-bigquery-dataframes/issues/93)) ([969800d](https://github.com/googleapis/python-bigquery-dataframes/commit/969800d669204de4d0f2e5e61da521217e55668b))


### Documentation

* Add more preprocessing models into the docs menu. ([#97](https://github.com/googleapis/python-bigquery-dataframes/issues/97)) ([1592315](https://github.com/googleapis/python-bigquery-dataframes/commit/159231505f339173560cd802dae3fed3e63a663b))

## [0.6.0](https://github.com/googleapis/python-bigquery-dataframes/compare/v0.5.0...v0.6.0) (2023-10-04)


4 changes: 3 additions & 1 deletion bigframes/_config/bigquery_options.py
Original file line number Diff line number Diff line change
@@ -83,12 +83,14 @@ def project(self, value: Optional[str]):

@property
def bq_connection(self) -> Optional[str]:
"""Name of the BigQuery connection to use.
"""Name of the BigQuery connection to use. Should be of the form <PROJECT_NUMBER/PROJECT_ID>.<LOCATION>.<CONNECTION_ID>.
You should either have the connection already created in the
<code>location</code> you have chosen, or you should have the Project IAM
Admin role to enable the service to create the connection for you if you
need it.
If this option isn't provided, or project or location aren't provided, session will use its default project/location/connection_id as default connection.
"""
return self._bq_connection

24 changes: 24 additions & 0 deletions bigframes/clients.py
Original file line number Diff line number Diff line change
@@ -29,6 +29,8 @@
)
logger = logging.getLogger(__name__)

_BIGFRAMES_DEFAULT_CONNECTION_ID = "bigframes-default-connection"


class BqConnectionManager:
"""Manager to handle operations with BQ connections."""
@@ -162,3 +164,25 @@ def _get_service_account_if_connection_exists(
pass

return service_account


def get_connection_name_full(
connection_name: Optional[str], default_project: str, default_location: str
) -> str:
"""Retrieve the full connection name of the form <PROJECT_NUMBER/PROJECT_ID>.<LOCATION>.<CONNECTION_ID>.
Use default project, location or connection_id when any of them are missing."""
if connection_name is None:
return (
f"{default_project}.{default_location}.{_BIGFRAMES_DEFAULT_CONNECTION_ID}"
)

if connection_name.count(".") == 2:
return connection_name

if connection_name.count(".") == 1:
return f"{default_project}.{connection_name}"

if connection_name.count(".") == 0:
return f"{default_project}.{default_location}.{connection_name}"

raise ValueError(f"Invalid connection name format: {connection_name}.")
Loading