How should you improve the performance of your model?

Correct Answer for the Question – How should you improve the performance of your model? is given below You work on a regression problem in a natural language processing domain, and you have 100M labeled exmaples in your dataset. You have randomly shuffled your data and split your dataset into train and test samples (in a … Read more

Which cloud-native service should you use to orchestrate the entire pipeline?

Correct Answer for the Question – Which cloud-native service should you use to orchestrate the entire pipeline? is given below Your company has a hybrid cloud initiative. You have a complex data pipeline that moves data between cloud provider services and leverages services from each of the cloud providers. Which cloud-native service should you use to … Read more

How should you migrate this data to Cloud Storage?

Correct Answer for the Question – How should you migrate this data to Cloud Storage? is given below You need to move 2 PB of historical data from an on-premises storage appliance to Cloud Storage within six months, and your outbound network capacity is constrained to 20 Mb/sec. How should you migrate this data to Cloud … Read more

How should you design data storage for this solution?

Correct Answer for the Question – How should you design data storage for this solution? is given below You are designing a cloud-native historical data processing system to meet the following conditions:The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Cloud Dataproc, BigQuery, and ComputeEngine. … Read more

Which Stackdriver alerts should you create?

Correct Answer for the Question – Which Stackdriver alerts should you create? is given below You are operating a Cloud Dataflow streaming pipeline. The pipeline aggregates events from a Cloud Pub/Sub subscription source, within a window, and sinks the resulting aggregation to a Cloud Storage bucket. The source has consistent throughput. You want to monitor an … Read more

What should you do to improve the performance of your application?

Correct Answer for the Question – What should you do to improve the performance of your application? is given below You operate a database that stores stock trades and an application that retrieves average stock price for a given company over an adjustable window of time.The data is stored in Cloud Bigtable where the datetime of … Read more

Which strategy should you choose?

Correct Answer for the Question – Which strategy should you choose? is given below You need to create a new transaction table in Cloud Spanner that stores product sales data. You are deciding what to use as a primary key. From a performance perspective, which strategy should you choose?Reference: https://www.uuidgenerator.net/version4 The current epoch time A concatenation … Read more

How should you organize your data in BigQuery and store your backups?

Correct Answer for the Question – How should you organize your data in BigQuery and store your backups? is given below You use BigQuery as your centralized analytics platform. New data is loaded every day, and an ETL pipeline modifies the original data and prepares it for the final users. This ETL pipeline is regularly modified … Read more

How should you create the ML pipeline?

Correct Answer for the Question – How should you create the ML pipeline? is given below A data scientist has created a BigQuery ML model and asks you to create an ML pipeline to serve predictions. You have a REST API application with the requirement to serve predictions for an individual user ID with latency under … Read more

Which solution should you choose?

Correct Answer for the Question – Which solution should you choose? is given below You use a dataset in BigQuery for analysis. You want to provide third-party companies with access to the same dataset. You need to keep the costs of data sharing low and ensure that the data is current. Which solution should you choose? … Read more